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SECRETED PROTEINS 
TECHNICAL FIELD 

The invention relates to novel nucleic acids, secreted proteins encoded by these nucleic acids, 
5 and to the use of these nucleic acids and proteins in the diagnosis, treatment, and prevention of cell 
proliferative, autoimmune/inflammatory, cardiovascular, neurological, and developmental disorders. 
The invention also relates to the assessment of the effects of exogenous compounds on the expression 
of nucleic acids and secreted proteins. 

10 BACKGROUND OF THE INVENTION 

Protein transport and secretion are essential for cellular function. Protein transport is 
mediated by a signal peptide located at the amino terminus of the protein to be transported or secreted. 
The signal peptide is comprised of about ten to twenty hydrophobic amino acids which target the 
nascent protein from the ribosome to a particular membrane bound compartment such as the 

15 endoplasmic reticulum (ER). Proteins targeted to the ER may either proceed through the secretory 
pathway or remain in any of the secretory organelles such as the ER, Golgi apparatus, or lysosomes. 
Proteins that transit through the secretory pathway are either secreted into the extracellular space or 
retained in the plasma membrane. Proteins that are retained in the plasma membrane contain one or 
more transmembrane domains, each comprised of about 20 hydrophobic amino acid residues. 

20 Secreted proteins are generally synthesized as inactive precursors that are activated by post- 

translational processing events during transit through the secretory pathway. Such events include 
glycosylation, proteolysis, and removal of the signal peptide by a signal peptidase. Other events that 
may occur during protein transport include chaperone-dependent unfolding and folding of the nascent 
protein and interaction of the protein with a receptor or pore complex. Examples of secreted proteins 

25 with amino terminal signal peptides are discussed below and include proteins with important roles in 
cell-to-cell signaling. Such proteins include transmembrane receptors and cell surface markers, 
extracellular matrix molecules, cytokines, hormones, growth and differentiation factors, enzymes, 
neuropeptides, vasomediators, cell surface markers, and antigen recognition molecules. (Reviewed in 
Alberts, B. et al. (1994) Molecular Biology of The Cell . Garland Publishing, New York, NY, pp. 557- 

30 560,582-592.) 

Cell surface markers include cell surface antigens identified on leukocytic cells of the immune 
system. These antigens have been identified using systematic, monoclonal antibody (m Ab)-based 
"shot gun" techniques. These techniques have resulted in the production of hundreds of m Abs 
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directed against unknown cell surface leukocytic antigens. These antigens have been grouped into 
"clusters of differentiation" based on common immunocytocheniical localization patterns in various 
differentiated and undifferentiated leukocytic cell types. Antigens in a given cluster are presumed to 
identify a single cell surface protein and are assigned a "cluster of differentiation" or "CD" 

5 designation. Some of the genes encoding proteins identified by CD antigens have been cloned and 
verified by standard molecular biology techniques. CD antigens have been characterized as both 
transmembrane proteins and cell surface proteins anchored to the plasma membrane via covalent 
attachment to fatty acid-containing glycolipids such as glycosylphosphatidylinositol (GP1). (Reviewed 
in Barclay, A.N. et al. (1995) The Leucocyte Antigen Facts Book . Academic Press, San Diego, CA, 

10 pp. 17-20.) 

Matrix proteins (MPs) are transmembrane and extracellular proteins which function in 
formation, growth, remodeling, and maintenance of tissues and as important mediators and regulators 
of the inflammatory response. The expression and balance of MPs may be perturbed by biochemical 
changes that result from congenital, epigenetic, or infectious diseases. In addition, MPs affect 

15 leukocyte migration, proliferation, differentiation, and activation in the immune response. MPs are 
frequently characterized by the presence of one or more domains which may include collagen-like 
domains, EGF-like domains, immunoglobulin-like domains, and fibronectin-like domains. In addition, 
MPs may be heavily glycosylated and may contain an Arginine-Glycine- Aspartate (RGD) tripeptide 
motif which may play a role in adhesive interactions. MPs include extracellular proteins such as 

20 fibronectin, collagen, galectin, vitronectin and its proteolytic derivative somatomedin B; and cell 
adhesion receptors such as cell adhesion molecules (CAMs), cadherins, and integrins. (Reviewed in 
Ayad, S. et al. (1994) The Extracellular Matrix Facts Book , Academic Press, San Diego, CA, pp. 2- 
16; Ruoslahti, E. (1997) Kidney Int. 51:1413-1417; Sjaastad, M.D. and WJ. Nelson (1997) BioEssays 
19:47-55.) 

25 Mucins are highly glycosylated glycoproteins that are the major structural component of the 

mucus gel. The physiological functions of mucins are cytoprotection, mechanical protection, 
maintenance of viscosity in secretions, and cellular recognition. MUC6 is a human gastric mucin that 
is also found in gall bladder, pancreas, seminal vesicles, and female reproductive tract (Toribara, N.W. 
et al. (1997) J. Biol. Chem. 272:16398-16403). The MUC6 gene has been mapped to human 

30 chromosome 1 1 (Toribara, N.W. et al. (1993) J. Biol. Chem. 268:5879-5885). Hemomucin is a novel 
Drosophila surface mucin that may be involved in the induction of antibacterial effector molecules 
(Theopold, U. et al. (1996) J. Biol. Chem. 217:12708-12715). 

Tuftelins are one of four different enamel matrix proteins that have been identified so far. 



2 



WO 03/068943 



PCTAJS03/04712 



The other three known enamel matrix proteins are the amelogenins, enamelin and ameloblastic 
Assembly of the enamel extracellular matrix from these component proteins is believed to be critical in 
producing a matrix competent to undergo mineral replacement (Paine, C.T. et al. (1998) Connect 
Tissue Res. 38:257-267). Tuftelin mRNA has been found to be expressed in human ameloblastoma 
5 tumor, a non-mineralized odontogenic tumor (Deutsch, D. et al. (1998) Connect. Tissue Res. 
39:177-184). 

Olfactomedin-related proteins are extracellular matrix, secreted glycoproteins with conserved 
C-terminal motifs. They are expressed in a wide variety of tissues and in a broad range of species, 
from Caenorhabditis elegans to Homo sapiens. Olfactomedin-related proteins comprise a gene 

10 family with at least 5 family members in humans. One of the five, HGR/myocilin protein, is expressed 
in the eye and is associated with the pathogenesis of glaucoma (Kulkarni, N.H. et al. (2000) Genet. 
Res. 76:41-50). Research by Yokoyama, M. et al. (1996; DNA Res. 3:31 1-320) found a 135-amino 
acid protein, termed AMY, having 96% sequence identity with rat neuronal olfactomedin-releated ER 
localized protein in a neuroblastoma cell line cDNA library, suggesting an essential role for AMY in 

15 nerve tissue. Neuron-specific olfactomedin-related glycoproteins isolated from rat brain cDNA 

libraries show strong sequence similarity with olfactomedin. This similarity is suggestive of a matrix- 
related function of these glycoproteins in neurons and neurosecretory cells (Danielson, P.E. et al. 
(1994) J. Neurosci. Res. 38:468-478). 

Mac-2 binding protein is a 90-kD serum protein (90K), a secreted glycoprotein isolated from 

20 both the human breast carcinoma cell line SK-BR-3, and human breast milk. It specifically binds to a 
human macrophage-associated lectin, Mac-2. Structurally, the mature protein is 567 amino acids in 
length and is proceeded by an 18-amino acid leader. There are 16 cysteines and seven potential re- 
linked glycosylation sites. The first 106 amino acids represent a domain very similar to an ancient 
protein superfamily defined by a macrophage scavenger receptor cysteine-rich domain (Koths, K. et 

25 al. (1993) J. Biol. Chem. 268:14245-14249). 90K is elevated in the serum of subpopulations of AIDS 
patients and is expressed at varying levels in primary tumor samples and tumor cell lines. Ullrich, A. 
et al. (1994; J. Biol. Chem. 269:18401-18407) have demonstrated that 90K stimulates host defense 
systems and can induce interleukin-2 secretion. This immune stimulation is proposed to be a result of 
oncogenic transformation, viral infection or pathogenic invasion (Ullrich et al., supra). 

30 Semaphores are a large group of axonal guidance molecules consisting of at least 30 different 

members and are found in vertebrates, invertebrates, and even certain viruses. All semaphorins 
contain the sema domain which is approximately 500 amino acids in length. Neuropilin, a semaphorin 
receptor, has been shown to promote neurite outgrowth in vitro. The extracellular region of 



3 



WO 03/068943 



PCT/US03/04712 



neuropilins consists of three different domains: CUB, discoidin, and MAM domains. The CUB and 
the MAM motifs of neuropilin have been suggested to have roles in protein-protein interactions and 
are thought to be involved in the binding of semaphorins through the sema and the C-terminal domains 
(reviewed in Raper, J.A. (2000) Curr. Opin. Neurobiol. 10:88-94). Plexins are neuronal cell surface 

5 molecules that mediate cell adhesion via a homophilic binding mechanism in the presence of calcium 
ions. Plexins have been shown to be expressed in the receptors and neurons of particular sensory 
systems (Ohta, K. et al. (1995) Cell 14:1 189-1199). There is evidence that suggests that some plexins 
function to control motor and CNS axon guidance in the developing nervous system. Plexins, which 
themselves contain complete semaphorin domains, may be both the ancestors of classical semaphorins 

10 and binding partners for semaphorins (Winberg, M.L. et a] (1998) Cell 95:903-916). 

Human pregnancy-specific beta 1 -glycoprotein (PSG) is a family of closely related 
glycoproteins of molecular weights of 72 KDa, 64KDa, 62KDa, and 54KDa. Together with the 
carcinoembryonic antigen, they comprise a subfamily within the immunoglobulin superfamily (Plouzek, 
C.A. and J.Y. Chou, (1991) Endocrinology 129550-958) Different subpopulations of PSG have been 

15 found to be produced by the trophoblasts of the human placenta, and the amnionic and chorionic 
membranes (Plouzek, C.A. et al. (1993) Placenta 14:277-285). 

Torsion dystonia is an autosomal dominant movement disorder consisting of involuntary 
muscular contractions. Hie disorder has been linked to a 3-base pair mutation in the DYT-1 gene, 
which encodes torsin A (Ozelius, LJ. et al. (1997) Nat. Genet. 17:40-48). Torsin A bears significant 

20 homology to the HsplOO/Clp family of ATPase chaperones, which are conserved in humans, rats, 
mice, and G elegans. Strong expression of DYT-1 in neuronal processes indicates a potential role 
for torsins in synaptic communication (Kustedjo, K. et al. (2000) 3. Biol. Chem. 275:27933-27939 and 
Konakova M. et al. (2001) Arch. Neurol. 58:921-927). 

Autocrine motility factor (AMF) is one of the motility cytokines regulating tumor cell 

25 migration; therefore identification of the signaling pathway coupled with it has critical importance. 
Autocrine motility factor receptor (AMFR) expression has been found to be associated with tumor 
progression in thymoma (Ohta Y. et al. (2000) Int. J. Oncol. 17:259-264). AMFR is a cell surface 
glycoprotein of molecular weight 78KDa. 

Hormones are secreted molecules that travel through the circulation and bind to specific 

30 receptors on the surface of, or within, target cells. Although they have diverse biochemical 

compositions and mechanisms of action, hormones can be grouped into two categories. One category 
includes small lipophilic hormones that diffuse through the plasma membrane of target cells, bind to 
cytosolic or nuclear receptors, and form a complex that alters gene expression. Examples of these 
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molecules include retinoic acid, thyroxine, and the cholesterol-derived steroid hormones such as 
progesterone, estrogen, testosterone, Cortisol, and aldosterone. The second category includes 
hydrophilic hormones that function by binding to cell surface receptors that transduce signals across 
the plasma membrane. Examples of such hormones include amino acid derivatives such as 

5 catecholamines (epinephrine, norepinephrine) and histamine, and peptide hormones such as glucagon, 
insulin, gastrin, secretin, cholecystokinin, adrenocorticotropic hormone, follicle stimulating hormone, 
luteinizing hormone, thyroid stimulating hormone, and vasopressin. (See, for example, Lodish et al. 
(1995) Molecular Cell Biology . Scientific American Books Inc., New York, NY, pp. 856-864.) 
Proopiomelanocortin (POMC) is the precursor polypeptide of corticotropin (ACTH), a 

10 hormone synthesized by the anterior pituitary gland, which functions in the stimulation of the adrenal 
cortex. POMC is also the precursor polypeptide of the hormone beta-lipotropin (beta-LPH). Each 
hormone includes smaller peptides with distinct biological activities: alpha-melanotropin (alpha-MSH) 
and corticotropin-like intermediate lobe peptide (CUP) are formed from ACTH; gamma-lipotropin 
(gamma-LPH) and beta-endorphin are peptide components of beta-LPH; while beta-MSH is 

15 contained within gamma-LPH. Adrenal insufficiency due to ACTH deficiency, resulting from a 
genetic mutation in exons 2 and 3 of POMC results in an endocrine disorder characterized by early- 
onset obesity, adrenal insufficiency, and red hair pigmentation (Chretien, M. et al. (1979) Can. J. 
Biochem. 57:1 1 1 1-1121 ; Krude, H. et al. (1998) Nat. Genet. 19:155-157; Online Mendelian 
Inheritance in Man (OMIM) 176830). 

20 Growth and differentiation factors are secreted proteins which function in intercellular 

communication. Some factors require oligomerization or association with membrane proteins for 
activity. Complex interactions among these factors and their receptors trigger intracellular signal 
transduction pathways that stimulate or inhibit cell division, cell differentiation, cell signaling, and cell 
motility. Most growth and differentiation factors act on cells in their local environment (paracrine 

25 signaling). There are three broad classes of growth and differentiation factors. The first class 
includes the large polypeptide growth factors such as epidermal growth factor, fibroblast growth 
factor, transforming growth factor, insulin-like growth factor, and platelet-derived growth factor. The 
second class includes the hematopoietic growth factors such as the colony stimulating factors (CSFs). 
Hematopoietic growth factors stimulate the proliferation and differentiation of blood cells such as B- 

30 lymphocytes, T-lymphocytes, erythrocytes, platelets, eosinophils, basophils, neutrophils, macrophages, 
and their stem cell precursors. The third class includes small peptide factors such as bombesin, 
vasopressin, oxytocin, endothelin, transferrin, angiotensin 11, vasoactive intestinal peptide, and 
bradykinin, which function as hormones to regulate cellular functions other than proliferation. 
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Growth and differentiation factors play critical roles in neoplastic transformation of cells in 
vitro and in tumor progression in vivo. Inappropriate expression of growth factors by tumor cells may 
contribute to vascularization and metastasis of tumors. During hematopoiesis, growth factor 
misregulation can result in anemias, leukemias, and lymphomas. Certain growth factors such as 

5 interferon are cytotoxic to tumor cells both in vivo and in vitro. Moreover, some growth factors and 
growth factor receptors are related both structurally and functionally to oncoproteins. In addition, 
growth factors affect transcriptional regulation of both proto-oncogenes and oncosuppressor genes. 
(Reviewed in Pimentel, E. (1994) Handbook of Growth Factors . CRC Press, Ann Arbor, Ml, pp. 1-9.) 
The Slit protein, first identified in Drosophila, is critical in central nervous system midline 

10 formation and potentially in nervous tissue histogenesis and axonal pathfinding. Itoh et al. (1998; Brain 
Res. Mol. Brain Res. 62:175-186) have identified mammalian homologies of the slit gene (human Slit- 
1, Slit-2, Slit-3 and rat Slit-1). The encoded proteins are putative secreted proteins containing EGF-like 
motifs and leucine-rich repeats, both of which are conserved protein-protein interaction domains. Slit- 
1, -2, and -3 mRNAs are expressed in the brain, spinal cord, and thyroid, respectively (ltoh et al., 

15 supra). The Slit family of proteins are indicated to be functional ligands of glypican-1 in nervous 
tissue and it is suggested that their interactions may be critical in certain stages during central nervous 
system histogenesis (Liang, Y. et al. (1999) J. Biol. Chem. 274:17885-17892). 

Neuropeptides and vasomediators (NP/VM) comprise a large family of endogenous signaling 
molecules. Included in this family are neuropeptides and neuropeptide hormones such as bombesin, 

20 neuropeptide Y, neurotensin, neuromedin N, melanocortins, opioids, galanin, somatostatin, tachykinins, 
urotensin D and related peptides involved in smooth muscle stimulation, vasopressin, vasoactive 
intestinal peptide, and circulatory system-borne signaling molecules such as angiotensin, complement, 
calcitonin, endothelins, formyl-methionyl peptides, glucagon, cholecystokinin and gastrin. NP/VMs can 
transduce signals directly, modulate the activity or release of other neurotransmitters and hormones, 

25 and act as catalytic enzymes in cascades. The effects of NP/VMs range from extremely brief to 

long-lasting. (Reviewed in Martin, C.R. et al. (1985) Endocrine Physiology . Oxford University Press, 
New York, NY, pp. 57-62.) 

NP/VMs are involved in numerous neurological and cardiovascular disorders. For example, 
neuropeptide Y is involved in hypertension, congestive heart failure, affective disorders, and appetite 

30 regulation. Somatostatin inhibits secretion of growth hormone and prolactin in the anterior pituitary, as 
well as inhibiting secretion in intestine, pancreatic acinar cells, and pancreatic beta-cells. A reduction 
in somatostatin levels has been reported in Alzheimer's disease and Parkinson's disease. Vasopressin 
acts in the kidney to increase water and sodium absorption, and in higher concentrations stimulates 
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contraction of vascular smooth muscle, platelet activation, and glycogen breakdown in the liver. 
Vasopressin and its analogues are used clinically to treat diabetes insipidus. Endothelin and 
angiotensin are involved in hypertension, and drugs, such as captopril, which reduce plasma levels of 
angiotensin, are used to reduce blood pressure (Watson, S. and S. Arkinstall (1994) The G-protein 

5 Linked Receptor Facts Book . Academic Press, San Diego CA, pp. 194; 252; 284; 55; 111). 

Neuropeptides have also been shown to have roles in nociception (pain). Vasoactive intestinal 
peptide appears to play an important role in chronic neuropathic pain. Nociceptin, an endogenous 
ligand for for the opioid receptor-like 1 receptor, is thought to have a predominantly anti-nociceptive 
effect, and has been shown to have analgesic properties in different animal models of tonic or chronic 

10 pain (Dickinson, T. and S.M. Fleetwood-Walker (1998) Trends Pharmacol. Sci. 19:346-348). 

Other proteins that contain signal peptides include secreted proteins with enzymatic activity. 
Such activity includes, for example, oxidoreductase/dehydrogenase activity, transferase activity, 
hydrolase activity, lyase activity, isomerase activity, or ligase activity. For example, matrix 
metalloproteinases are secreted hydrolytic enzymes that degrade the extracellular matrix and thus play 

15 an important role in tumor metastasis, tissue morphogenesis, and arthritis (Reponen, P. et al. (1995) 
Dev. Dyn. 202:388-396; Firestein, G.S. (1992) Curr. Opin. Rheumatol. 4:348-354; Ray, J.M. and W.G. 
Stetler-Stevenson (1994) Eur. Respir. J. 7:2062-2072; and Mignatti, P. and D.B. Rifkin (1993) Physiol. 
Rev. 73:161-195). Additional examples are the acetyl-CoA synthetases which activate acetate for 
use in lipid synthesis or energy generation (Luong, A. et al. (2000) J. Biol. Chem. 275:26458-26466). 

20 The result of acetyl-CoA synthetase activity is the formation of acetyl-CoA from acetate and CoA. 
Acetyl-CoA sythetases share a region of sequence similarity identified as the AMP-binding domain 
signature. Acetyl-CoA synthetase has been shown to be associated with hypertension (Toh, H. 
(1991) Protein Seq. Data Anal. 4:1 1 1-1 17; and Iwai, N. et al. (1994) Hypertension 23:375-380). 
A number of isomerases catalyze steps in protein folding, phototransduction, and various 

25 anabolic and catabolic pathways. One class of isomerases is known as peptidyl-prolyl cis-trans 

isomerases (PPIases). PPlases catalyze the cis to trans isomerization of certain proline imidic bonds 
in proteins. Two families of PPlases are the FK506 binding proteins (FKBPs), and cyclophilins 
(CyPs). FKBPs bind the potent immunosuppressants FK506 and rapamycin, thereby inhibiting 
signaling pathways in T-cells. Specifically, the PPIase activity of FKBPs is inhibited by binding of 

30 FK506 or rapamycin. There are five members of the FKBP family which are named according to 
their calculated molecular masses (FKBP12, FKBP13, FKBP25, FKBP52, and FKBP65), and 
localized to different regions of the cell where they associate with different protein complexes (Coss, 
M. et al. (1995) J. Biol. Chem. 27029336-29341 ; Schreiber, S.L. (1991) Science 251:283-287). 
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The peptidyl-prolyl isomerase activity of CyP may be part of the signaling pathway that leads 
to T-cell activation. CyP isomerase activity is associated with protein folding and protein trafficking, 
and may also be involved in assembly/disassembly of protein complexes and regulation of protein 
activity. For example, in Drosophila> the CyP NinaA is required for correct localization of 

5 rhodopsins, while a mammalian CyP (Cyp40) is part of the Hsp90/Hsc70 complex that binds steroid 
receptors. The mammalian CypA has been shown to bind the gag protein from human 
immunodeficiency virus 1 (HIV-1), an interaction that can be inhibited by cyclosporin. Since 
cyclosporin has potent anti-HIV-1 activity, CypA may play an essential function in HIV-l replication. 
Finally, Cyp40 has been shown to bind and inactivate the transcription factor c-Myb, an effect that is 

10 reversed by cyclosporin. This effect implicates CyPs in the regulation of transcription, transformation, 
and differentiation (Bergsma, DJ. et al (1991) J. Biol. Chem. 266:23204-23214; Hunter, T. (1998) Cell 
92:141-143; and Leverson, J.D. and S.A. Ness, (1998) Mol. Cell. 1:203-211). 

Gamma-carboxyglutamic acid (Gla) proteins rich in proline (PRGPs) are members of a family 
of vitamin K-dependent single-pass integral membrane proteins. These proteins are characterized by 

15 an extracellular amino terminal domain of approximately 45 amino acids rich in Gla. The intracellular 
carboxyl terminal region contains one or two copies of the sequence PPXY, a motif present in a 
variety of proteins involved in such diverse cellular functions as signal transduction, cell cycle 
progression, and protein turnover (Kulman, J.D. et al. (2001) Proc. Natl. Acad. Sci. USA 98:1370- 
1 375). The process of post-translational modification of glutamic residues to form Gla is Vitamin K- 

20 dependent carboxylation. Proteins which contain Gla include plasma proteins involved in blood 
coagulation. These proteins are prothrombin, proteins C, S, and Z, and coagulation factors VII, IX, 
and X. Osteocalcin (bone-Gla protein, BGP) and matrix Gla-protein (MGP) also contain Gla 
(Friedman, P.A. and C.T. Przysiecki (1987) Int. J. Biochem. 19:1-7; Vermeer, C. (1990) Biochem. J. 
266:625-636). 

25 Immunoglobulins 

Antigen recognition molecules are key players in the sophisticated and complex immune 
systems which all vertebrates have developed to provide protection from viral, bacterial, fungal, and 
parasitic infections. A key feature of the immune system is its ability to distinguish foreign molecules, 
or antigens, from "self * molecules. This ability is mediated primarily by secreted and transmembrane 

30 proteins expressed by leukocytes (white blood cells) such as lymphocytes, granulocytes, and 

monocytes. Most of these proteins belong to the immunoglobulin (lg) superfamily, members of which 
contain one or more repeats of a conserved structural domain. This lg domain is comprised of 
antiparallel P sheets joined by a disulfide bond in an arrangement called the lg fold. The criteria for a 
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protein to be a member of the Ig superfamily is to have one or more Ig domains, which are regions of 
70-1 10 amino acid residues in length homologous to either Ig variable-like (V) or lg constant-like (C) 
domains. Members of the lg superfamily include antibodies (Ab), T cell receptors (TCRs), class 1 and 
II major histocompatibility (MHC) proteins and immune cell-specific surface markers such as the 
5 "cluster of differentiation" or CD antigens, CD2, CD3, CD4, CD8, poly-lg receptors, Fc receptors, 
neural cell-adhesion molecule (NCAM) and platelet-derived growth factor receptor (PDGFR). 

lg domains (V and C) are regions of conserved amino acid residues that give a polypeptide a 
globular tertiary structure called an immunoglobulin (or antibody) fold, which consists of two 
approximately parallel layers of P-sheets. Conserved cysteine residues form an intrachain disulfide- 

10 bonded loop, 55-75 amino acid residues in length, which connects the two layers of P-sheets. Each P~ 
sheet has three or four anti-parallel P-strands of 5-10 amino acid residues. Hydrophobic and 
hydrophilic interactions of amino acid residues within the P-strands stabilize the lg fold (hydrophobic 
on inward facing amino acid residues and hydrophilic on the amino acid residues in the outward facing 
portion of the strands). A V domain consists of a longer polypeptide than a C domain, with an 

15 additional pair of P-strands in the lg fold. 

A consistent feature of Ig superfamily genes is that each sequence of an lg domain is 
encoded by a single exon. It is possible that the superfamily evolved from a gene coding for a single 
lg domain involved in mediating cell-cell interactions. New members of the superfamily then arose by 
exon and gene duplications. Modem lg superfamily proteins contain different numbers of V and/or C 

20 domains. Another evolutionary feature of this superfamily is the ability to undergo DNA 
rearrangements, a unique feature retained by the antigen receptor members of the family. 

Many members of the Ig superfamily are integral plasma membrane proteins with 
extracellular lg domains. The hydrophobic amino acid residues of their transmembrane domains and 
their cytoplasmic tails are very diverse, with little or no homology among Ig family members or to 

25 known signal-transducing structures. There are exceptions to this general superfamily description. 
For example, the cytoplasmic tail of PDGFR has tyrosine kinase activity. In addition Thy-1 is a 
glycoprotein found on thymocytes and T cells. This protein has no cytoplasmic tail, but is instead 
attached to the plasma membrane by a covalent glycophosphatidylinositol linkage. 

Another common feature of many lg superfamily proteins is the interactions between lg 

30 domains which are essential for the function of these molecules. Interactions between lg domains of 
a multimeric protein can be either homophiUc or heterophilic (i.e., between the same or different lg 
domains). Antibodies are multimeric proteins which have both hemophilic and heterophilic interactions 
between lg domains. Pairing of constant regions of heavy chains forms the Fc region of an antibody 
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and pairing of variable regions of light and heavy chains form the antigen binding site of an antibody. 
Heterophil^ interactions also occur between lg domains of different molecules. These interactions 
provide adhesion between cells for significant cell-cell interactions in the immune system and in the 
developing and mature nervous system. (Reviewed in Abbas, A.K. et al. (1991) Cellular and 
5 Molecular Immunology . W.B. Saunders Company, Philadelphia, PA, pp. 142-145.) 
Antibodies 

MHC proteins are cell surface markers that bind to and present foreign antigens to T cells. 
MHC molecules are classified as either class 1 or class U. Class I MHC molecules (MHC I) are 
expressed on the surface of almost all cells and are involved in the presentation of antigen to cytotoxic 

10 T cells. For example, a cell infected with virus will degrade intracellular viral proteins and express the 
protein fragments bound to MHC 1 molecules on the cell surface. The MHC 1/antigen complex is 
recognized by cytotoxic T-cells which destroy the infected cell and the virus within. Class 11 MHC 
molecules are expressed primarily on specialized antigen-presenting cells of the immune system, such 
as B-cells and macrophages. These cells ingest foreign proteins from the extracellular fluid and 

15 express MHC D/antigen complex on the cell surface. This complex activates helper T-cells, which 
then secrete cytokines and other factors that stimulate the immune response. MHC molecules also 
play an important role in organ rejection following transplantation. Rejection occurs when the 
recipient's T-cells respond to foreign MHC molecules on the transplanted organ in the same way as to 
self MHC molecules bound to foreign antigen. (Reviewed in Alberts et al., supra, pp. 1229-1246.) 

20 Antibodies are multimeric members of the Ig superfamily which are either expressed on the 

surface of B-cells or secreted by B-cells into the circulation. Antibodies bind and neutralize foreign 
antigens in the blood and other extracellular fluids. The prototypical antibody is a tetramer consisting 
of two identical heavy polypeptide chains (H-chains) and two identical light polypeptide chains (L- 
chains) interlinked by disulfide bonds. This arrangement confers the characteristic Y-shape to 

25 antibody molecules. Antibodies are classified based on their H-chain composition. The five antibody 
classes, IgA, IgD, IgE, IgG and IgM, are defined by the a, 8, e, 7, and \i H-chain types. There are 
two types of L-chains, k and X, either of which may associate as a pair with any H-chain pair. IgG, 
the most common class of antibody found in the circulation, is tetrameric, while the other classes of 
antibodies are generally variants or muhimers of this basic structure. 

30 H-chains and l-chains each contain an N-terminal variable region and a C-terminal constant 

region. The constant region consists of about 1 10 amino acids in L-chains and about 330 or 440 amino 
acids in H-chains. The amino acid sequence of the constant region is nearly identical among H- or L- 
chains of a particular class. The variable region consists of about 110 amino acids in both H- and L- 



10 



WO 03/068943 



PCT/US03/04712 



chains. However, the amino acid sequence of the variable region differs among H- or L-chains of a 
particular class. Within each H- or L-chain variable region are three hypervariable regions of 
extensive sequence diversity, each consisting of about 5 to 10 amino acids. In the antibody molecule, 
the H- and L-chain hypervariable regions come together to form the antigen recognition site. 

5 (Reviewed in Alberts et al. supra, pp. 1206-1213; 1216-1217.) 

Both H-chains and L-chains contain the repeated Ig domains of members of the Ig 
superfamily. For example, a typical H-chain contains four lg domains, three of which occur within the 
constant region and one of which occurs within the variable region and contributes to the formation of 
the antigen recognition site. Likewise, a typical L-chain contains two lg domains, one of which occurs 

10 within the constant region and one of which occurs within the variable region. 

The immune system is capable of recognizing and responding to any foreign molecule that 
enters the body. Therefore, the immune system must be armed with a fiill repertoire of antibodies 
against all potential antigens. Such antibody diversity is generated by somatic rearrangement of gene 
segments encoding variable and constant regions. These gene segments are joined together by site- 

15 specific recombination which occurs between highly conserved DNA sequences that flank each gene 
segment. Because there are hundreds of different gene segments, millions of unique genes can be 
generated combinatorially. In addition, imprecise joining of these segments and an unusually high rate 
of somatic mutation within these segments further contribute to the generation of a diverse antibody 
population. 

20 Ex pression profiling 

Microarrays are analytical tools used in bioanalysis. A microarray has a plurality of molecules 
spatially distributed over, and stably associated with, the surface of a solid support. Microarrays of 
polypeptides, polynucleotides, and/or antibodies have been developed and find use in a variety of 
applications, such as gene sequencing, monitoring gene expression, gene mapping, bacterial 

25 identification, drug discovery, and combinatorial chemistry. 

One area in particular in which microarrays find use is in gene expression analysis. Array 
technology can provide a simple way to explore the expression of a single polymorphic gene or the 
expression profile of a large number of related or unrelated genes. When the expression of a single 
gene is examined, arrays are employed to detect the expression of a specific gene or its variants. 

30 When an expression profile is examined, arrays provide a platform for identifying genes that are tissue 
specific, are affected by a substance being tested in a toxicology assay, are part of a signaling 
cascade, carry out housekeeping functions, or are specifically related to a particular genetic 
predisposition, condition, disease, or disorder. 
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Breast Cancer 

There are more than 180,000 new cases of breast cancer diagnosed each year, and the 
mortality rate for breast cancer approaches 10% of all deaths in females between the ages of 45-54 
(K. Gish (1999) AW1S Magazine 28:7-10). However the survival rate based on early diagnosis of 

5 localized breast cancer is extremely high (97%), compared with the advanced stage of the disease in 
which the tumor has spread beyond the breast (22%). Current procedures for clinical breast 
examination are lacking in sensitivity and specificity, and efforts are underway to develop 
comprehensive gene expression profiles for breast cancer that may be used in conjunction with 
conventional screening methods to improve diagnosis and prognosis of this disease (Perou CM et al. 

10 (2000) Nature 406:747-752). 

Breast cancer is a genetic disease commonly caused by mutations in cellular disease. 
Mutations in two genes, BRCA1 and BRCA2, are known to greatly predispose a woman to breast 
cancer and may be passed on from parents to children (Gish, supra ). However, this type of hereditary 
breast cancer accounts for only about 5% to 9% of breast cancers, while the vast majority of breast 

15 cancer is due to nonihherited mutations that occur in breast epithelial cells. 

A good deal is already known about the expression of specific genes associated with breast 
cancer. For example, the relationship between expression of epidermal growth factor (EGF) and its 
receptor, EGFR, to human mammary carcinoma has been particularly well studied. (See Khazaie et 
a U supra , and references cited therein for a review of this area.) Overexpression of EGFR, 

20 particularly coupled with down-regulation of the estrogen receptor, is a marker of poor prognosis in 
breast cancer patients. In addition, EGFR expression in breast tumor metastases is frequently 
elevated relative to the primary tumor, suggesting that EGFR is involved in tumor progression and 
metastasis. This is supported by accumulating evidence that EGF has effects on cell functions related 
to metastatic potential, such as cell motility, chemotaxis, secretion and differentiation. Changes in 

25 expression of other members of the erbB receptor family, of which EGFR is one, have also been 
implicated in breast cancer. The abundance of erbB receptors, such as HER-2/neu, HER-3, and 
HER-4, and their ligands in breast cancer points to their functional importance in the pathogenesis of 
the disease, and may therefore provide targets for therapy of the disease (Bacus, SS et al. (1994) Am 
J Clin Pathol 102:S13-S24). Other known markers of breast cancer include a human secreted frizzled 

30 protein mRNA that is downregulated in breast tumors; the matrix Gla protein which is overexpressed 
is human breast carcinoma cells; Drgl or RTP, a gene whose expression is diminished in colon, breast, 
and prostate tumors; maspin, a tumor suppressor gene downregulated in invasive breast carcinomas; 
and CaN19, a member of the S100 protein family, all of which are down regulated in mammary 
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carcinoma cells relative to normal mammary epithelial cells (Zhou Z et al. (1998) lnt J Cancer 78:95- 
99; Chen, L et al. (1990) Oncogene 5:1391-1395; Ulrix W et al (1999) FEBS Lett 455:23-26; Sager, R 
et al. (1996) Curr Top Microbiol Immunol 213:51-64; and Lee, SW et al. (1992) Proc Natl Acad Sci 
USA 89:2504-2508). 

5 Cell lines derived from human mammary epithelial cells at various stages of breast cancer 

provide a useful model to study the process of malignant transformation and tumor progression as it 
has been shown that these cell lines retain many of the properties of their parental tumors for lengthy 
culture periods (Wistuba II et al (1998) Clin Cancer Res 4:2931-2938). Such a model is particularly 
useful for comparing phenotypic and molecular characteristics of human mammary epithelial cells at 

10 various stages of malignant transformation. 
Colon Cancer 

Colorectal cancer is the second leading cause of cancer deaths in the United States. Colon 
cancer is associated with aging, since 90% of the total cases occur in individuals over the age of 55. 
A widely accepted hypothesis is that several contributing genetic mutations must accumulate over time 

15 in an individual who develops the disease. To understand the nature of genetic alterations in colorectal 
cancer, a number of studies have focused on the inherited syndromes. The first known inherited 
syndrome, Familial Adenomatous Polyposis (FAP), is caused by mutations in the Adenomatous 
Polyposis Coli gene (APC), resulting in truncated or inactive forms of the protein. This tumor 
suppressor gene has been mapped to chromosome 5q. The second known inherited syndrome is 

20 hereditary nonpolyposis colorectal cancer (HNPCC), which is caused by mutations in mismatch repair 
genes. 

Although hereditary colon cancer syndromes occur in a small percentage of the population 
and most colorectal cancers are considered sporadic, knowledge from studies of the hereditary 
syndromes can be generally applied. For instance, somatic mutations in APC occur in at least 80% of 

25 indiscriminate colon tumors. APC mutations are thought to be the initiating event in the disease. 
Other mutations occur subsequently. Approximately 50% of colorectal cancers contain activating 
mutations in ras, while 85% contain inactivating mutations in p53. Changes in these genes lead to 
gene expression changes in colon cancer. Less is understood about downstream targets of these 
mutations and the role they may play in cancer development and progression. 

30 Tangier Disease 

Tangier disease (TD) is a genetic disorder characterized by near absence of circulating high 
density lipoprotein (HDL) and the accumulation of cholesterol esters in many tissues, including tonsils, 
lymph nodes, liver, spleen, thymus, and intestine. Low levels of HDL represent a clear predictor of 
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premature coronary artery disease and homozygous TD correlates with a four- to six-fold increase in 
cardiovascular disease compared to controls. HDL plays a cardioprotective role in reverse 
cholesterol transport, the flux of cholesterol from peripheral cells such as tissue macrophages through 
plasma lipoproteins to the liver. The HDL protein, apolipoprotein A-I, plays a major role in this 
5 process, interacting with the cell surface to remove excess cholesterol and phospholipids. This 

pathway is severely impaired in TD and the defect lies in a specific gene, the ABC1 transporter. This 
gene is a member of the family of ATP-binding cassette transporters, which utilize ATP hydrolysis to 
transport a variety of substrates across membranes. 
Cell lines 

10 Cell lines derived from human mammary epithelial cells at various stages of breast cancer 

provide a useful model to study the process of malignant transformation and tumor progression as it 
has been shown that these cell lines retain many of the properties of their parental tumors for lengthy 
culture periods (Wistuba, LI. et al. (1998) Clin. Cancer Res. 4:2931-2938). Such a model is 
particularly useful for comparing phenotypic and molecular characteristics of human mammary 

15 epithelial cells at various stages of malignant transformation. 

The human C3A cell line is a clonal derivative of HepG2/C3 (hepatoma cell line, isolated from 
a 15-year-old male with liver tumor), which was selected for strong contact inhibition of growth. The 
use of a clonal population enhances the reproducibility of the cells. C3 A cells have many 
characteristics of primary human hepatocytes in culture: i) expression of insulin receptor and insulin- 

20 like growth factor 11 receptor; ii) secretion of a high ratio of serum albumin compared with a- 
fetoprotein iii) conversion of ammonia to urea and gfutamine; iv) metabolism of aromatic amino 
acids; and v) proliferation in glucose-free and insulin-free medium. The C3A cell line is now well 
established as an in vitro model of the mature human liver (Mickelson et al. (1995) Hepatology 22:866- 
875; Nagendra et al. (1997) Am J Physiol 272:G408-G416). 

25 Gemfibrozil is a fibric acid antilipemic agent that lowers serum triglycerides and produces 

favorable changes in lipoproteins. Gemfibrozil is effective in reducing the risk of coronary heart 
disease in men (Frick, M.H., et al. (1987) New Engl. J. Med; 317:1237-1245). The compound can 
inhibit peripheral lipolysis and decrease hepatic extraction of free fatty acids, which, decreases hepatic 
triglyceride production. Gemfibrozil also inhibits the synthesis and increases the clearance of 

30 apolipoprotein B, a carrier molecule for VLDL. Gemfibrozil has variable effects on LDL cholesterol. 
Although it causes moderate reductions in patients with type Ila hyperlipoproteinemia, changes in 
patients with either type lib or type IV hyperlipoproteinemia are unpredictable. In general, the HMG- 
CoA reductase inhibitors are more effective than gemfibrozil in reducing LDL cholesterol. At the 
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molecular level gemfibozil may function as a peroxisome proliferator-activated receptor (PPAR) 
agonist. Gemfibrozil is rapidly and completely absorbed from the GI tract and undergoes 
enterohepatic recirculation. Gemfibrozil is metabolized by the liver and excreted by the kidneys, 
mainly as metabolites, one of which possesses pharmacologic activity. Gemfibozil causes peroxisome 

5 proliferation and hepatocarcinogenesis in rats, which is a cause for concern generally for fibric acid 
derivative drugs. In humans, fibric acid derivatives are known to increase the risk of gall bladder 
disease although gemfibrozil is better tolerated than other fibrates. The relative safety of gemfibrozil 
in humans compared to rodent species including rats may be attributed to differences in metabolism 
and clearance of the compound in different species (Dix, K.J., et al., (1999) Drug Metab. Distrib. 27 

10 (1) 138-146; Thomas, B.F., et al., (1999) Drug Metab. Distrib. 27 (1) 147-157). 
Lung cancer 

Lung cancer is the leading cause of cancer death for men and the second leading cause of 
cancer death for women in the U.S. Lung cancers are divided into four histopathologically distinct 
groups. Three groups (squamous cell carcinoma, adenocarcinoma, and large cell carcinoma) are 

15 classified as non-small cell lung cancers (NSCLCs). The fourth group of cancers is referred to as 
small cell lung cancer (SCLC). Deletions on chromosome 3 are common in this disease and are 
thought to indicate the presence of a tumor suppressor gene in this region. Activating mutations in K- 
ras are commonly found in lung cancer and are the basis of one of the mouse models for the disease. 
In nearly 80% of patients diagnosed with lung cancer, metastasis has already occurred. Most 

20 commonly hing cancers metastasize to pleura, brain, bone, pericardium, and liver. This adversely 
affects the overall five-year survival rate which is 37% for squamous carcinoma, 27% for 
adenocarcinoma and large cell carcinoma, and less than 1% for small cell carcinomas. Earlier 
diagnosis and an systematic approach to identification, staging, and treatment could positively affect 
patient outcome (DeVita et al. (1997) Cancer: Principles and Practice of Oncology . Lippincott- 

25 Raven, Philadelphia PA) and Fauci et al. (1998) Harrisons Principals of Internal Medicine . McGraw 
Hill, New York, NY). 

Lung cancers progress through a series of morphologically distinct stages from hyperplasia to 
invasive carcinoma. Malignant hing cancers are divided into two groups comprising four 
histopathological classes. The nonsmall cell hing carcinoma (NSCLC) group includes squamous cell 
30 carcinomas, adenocarcinomas, and large cell carcinomas and accounts for about 70% of all lung 
cancer cases. Adenocarcinomas typically arise in the peripheral airways and often form mucin 
secreting glands. Squamous cell carcinomas typically arise in proximal airways. The histogenesis of 
squamous cell carcinomas may be related to chronic inflammation and injury to the bronchial 

15 
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epithelium, leading to squamous metaplasia. The small cell lung carcinoma (SCLC) group accounts 
for about 20% of lung cancer cases. SCLCs typically arise in proximal airways and exhibit a number 
of paraneoplastic syndromes including inappropriate production of adrenocorticotropin and anti-diuretic 
hormone. 

5 Lung cancer cells accumulate numerous genetic lesions, many of which are associated with 

cytologically visible chromosomal aberrations. The high frequency of chromosomal deletions 
associated with lung cancer may reflect the role of multiple tumor suppressor loci in the etiology of this 
disease. Several studies report deletions of regions of chromosome 1 1 in NSCLC (Bepler, G. and 
Garcia-Blanco, M A. (1994) Proc. Natl. Acad. Sci. USA 91:5513-7; lizuka, M., et al. (1995) Genes, 

10 Chromosomes & Cancer 13:40-46; Rasio, D. (1995) Cancer Research 55:3988-91 ). Deletions in 
other chromosome arms such as 3p, 9p and 17p are also common. Other frequently observed genetic 
lesions include overexpression of telomerase, activation of oncogenes such as K-ras and c-myc, and 
inactivation of tumor suppressor genes such as RB, p53 and pl6 (Toomey, D. et al.(2001) Cancer 
92:2648-57; Zajac-Kaye M. (2001) Lung Cancer 34:S43-6; Wright, G. et al. (2000) Current Opinion in 

15 Oncology 12:143-8; Kohno, T. and Yokota, J. (1999) Carcinogenesis 20:1403-10). 
Ovarian Cancer 

Ovarian cancer is the leading cause of death from a gynecologic cancer. The majority of 
ovarian cancers are derived from epithelial cells, and 70% of patients with epithelial ovarian cancers 
present with late-stage disease. As a result, the long-term survival rates for this disease is very low. 

20 Identification of early-stage markers for ovarian cancer would significantly increase the survival rate. 
The molecular events that lead to ovarian cancer are poorly understood. Some of the known 
aberrations include mutation of p53 and microsatellite instability. Since gene expression patterns are 
likely to vary when normal ovary is compared to ovarian tumors, examination of gene expression in 
these tissues to identify possible markers for ovarian cancer is particularly relevant to improving 

25 diagnosis, prognosis, and treatment of this disease. 

Prostate Cancer 

Prostate cancer is a common malignancy in men over the age of 50, and the incidence 
increases with age. In the US, there are approximately 132,000 newly diagnosed cases of prostate 
30 cancer and more than 33,000 deaths from the disorder each year. 

Once cancer cells arise in the prostate, they are stimulated by testosterone to a more rapid 
growth. Thus, removal of the testes can indirectly reduce both rapid growth and metastasis of the 
cancer. Over 95 percent of prostatic cancers are adenocarcinomas which originate in tlie prostatic 
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acini. The remaining 5 percent are divided between squamous cell and transitional cell carcinomas, 
both of which arise in the prostatic ducts or other parts of the prostate gland. 

As with most cancers, prostate cancer develops through a multistage progression ultimately 
resulting in an aggressive, metastatic phenotype. The initial step in tumor progression involves tl*e 

5 hyperproliferation of normal luminal and/or basal epithelial cells that become hyperplastic and evolve 
into early-stage tumors. The early-stage tumors are localized in the prostate but eventually may 
metastasize, particularly to the bone, brain or lung. About 80% of these tumors remain responsive to 
androgen treatment, an important hormone controlling the growth of prostate epithelial cells. 
However, in its most advanced state, cancer growth becomes androgen-independent and there is 

10 currently no known treatment for this condition. 

A primary diagnostic marker for prostate cancer is prostate specific antigen (PSA). PSA is a 
tissue-specific serine protease almost exclusively produced by prostatic epithelial cells. The quantity 
of PSA correlates with the number and volume of the prostatic epithelial cells, and consequently, the 
levels of PSA are an excellent indicator of abnormal prostate growth. Men with prostate cancer 

15 exhibit an early linear increase in PSA levels followed by an exponential increase prior to diagnosis. 
However, since PSA levels are also influenced by factors such as inflammation, androgen and other 
growth factors, some scientists maintain that changes in PSA levels are not useful in detecting 
individual cases of prostate cancer. 

Current areas of cancer research provide additional prospects for markers as well as potential 

20 therapeutic targets for prostate cancer. Several growth factors have been shown to play a critical role 
in tumor development, growth, and progression. The growth factors Epidermal Growth Factor (EGF), 
Fibroblast Growth Factor (FGF), and Tumor Growth Factor alpha (TGFa) are important in the growth 
of normal as well as hyperproliferative prostate epithelial cells, particularly at early stages of tumor 
development and progression, and affect signaling pathways in these cells in various ways (Lin J et al. 

25 (1999) Cancer Res. 59:2891-2897; Putz T et al. (1999) Cancer Res 59:227-233). The TGF-p family 
of growth factors are generally expressed at increased levels in human cancers and the high 
expression levels in many cases correlates with advanced stages of malignancy and poor survival 
(Gold LI (1999) Crit Rev Oncog 10:303-360). Finally, there are human cell lines representing both tlie 
androgen-dependent stage of prostate cancer (LNCap) as well as the androgen-independent, hormone 

30 refractory stage of the disease (PC3 and DU-145) that have proved useful in studying gene 
expression patterns associated with the progression of prostate cancer, and the effects of cell 
treatments on these expressed genes (Chung TD (1999) Prostate 15:199-207). 
Adipocyte Differentiation 
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The potential application of gene expression profiling is relevant to improving diagnosis, 
prognosis, and treatment of disease. For example, both the levels and sequences expressed in tissues 
from subjects with obesity or type II diabetes may be compared with the levels and sequences 
expressed in normal tissue. 

5 The primary function of adipose tissue is the ability to store and release fat during periods of 

feeding and fasting. White adipose tissue is the major energy reserve in periods of fasting, and its 
reserve is mobilized during energy deprivation. Adipose tissue is one of the primary target tissues for 
insulin, and adipogenesis and insulin resistance are linked in type II diabetes, non-insulin dependent 
diabetes mellitus (NIDDM). Cytologicaliy the conversion of a preadipocytes into mature adipocytes is 

10 characterized by deposition of fat droplets around the nuclei. The conversion process in vivo can be 
induced by thiazolidinediones (TZDs) and other PPARy agonists (Adams et al. (1997) J. Clin. Invest. 
100:3149*3153) which also lead to increased sensitivity to insulin and reduced plasma glucose and 
blood pressure. 

Thiazolidinediones (TZDs) act as agonists for the peroxisome-proliferator-acti vated receptor 

15 gamma (PPARy), a member of the nuclear hormone receptor superfamily. TZDs reduce 

hyperglycemia, hyperinsulinemia, and hypertension, in part by promoting glucose metabolism and 
inhibiting gluconeogenesis. Roles for PPARy and its agonists have been demonstrated in a wide range 
of pathological conditions including diabetes, obesity, hypertension, atherosclerosis, polycystic ovarian 
syndrome, and cancers such as breast, prostate, liposarcoma, and colon cancer. 

20 The mechanism by which TZDs and other PPARy agonists enhance insulin sensitivity is not 

fully understood, but may involve the ability of PPARy to promote adipogenesis. When ectopically 
expressed in cultured preadipocytes, PPARy is a potent inducer of adipocyte differentiation. TZDs, in 
combination with insulin and other factors, can also enhance differentiation of human preadipocytes in 
culture (Adams et al. (1997) J. Clin. Invest. 1003149-3153). The relative potency of different TZDs 

25 in promoting adipogenesis in vitro is proportional to both their insulin sensitizing effects in vivo , and 
their ability to bind and activate PPARy in vitro . Interestingly, adipocytes derived from omental 
adipose depots are refractory to the effects of TZDs. It has therefore been suggested that the insulin 
sensitizing effects of TZDs may result from their ability to promote adipogenesis in subcutaneous 
adipose depots (Adams et al., ibid ). Further, dominant negative mutations in the PPARy gene have 

30 been identified in two non-obese subjects with severe insulin resistance, hypertension, and overt non- 
insulin dependent diabetes mellitus (NIDDM) (Barroso et al. (1998) Nature 402:880-883). 

NIDDM is the most common form of diabetes mellitus, a chronic metabolic disease that 
affects 143 million people worldwide. NIDDM is characterized by abnormal glucose and lipid 
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metabolism that result from a combination of peripheral insulin resistance and defective insulin 
secretion. N1DDM has a complex, progressive etiology and a high degree of heritability. Numerous 
complications of diabetes including heart disease, stroke, renal failure, retinopathy, and peripheral 
neuropathy contribute to the high rate of morbidity and mortality. 

5 At the molecular level, PPARy functions as a ligand activated transcription factor. In the 

presence of ligand, PPARy forms a heterodimer with the retinoid X receptor (RXR) which then 
activates transcription of target genes containing one or more copies of a PPARy response element 
(PPRE). Many genes important in lipid storage and metabolism contain PPREs and have been 
identified as PPARy targets, including PEPCK, aP2, LPL, ACS, and FAT-P (Auwerx, J. (1999) 

10 Diabetologia 42:1033-1049). Multiple ligand s for PPARy have been identified. These include a 
variety of fatty acid metabolites; synthetic drugs belonging to the TZD class, such as Pioglitazone and 
Rosiglitazone (BRL49653); and certain non-glitazone tyrosine analogs such as G1262570 and 
GW1929. Hie prostaglandin derivative 15-dPGJ2 is a potent endogenous ligand for PPARy. 

Expression of PPARy is very high in adipose but barely detectable in skeletal muscle, the 

15 primary site for insulin stimulated glucose disposal in the body. PPARy is also moderately expressed 
in large intestine, kidney, liver, vascular smooth muscle, hematopoietic cells, and macrophages. The 
high expression of PPARy in adipose suggests that the insulin sensitizing effects of TZDs may result 
from alterations in the expression of one or more PPARy regulated genes in adipose tissue. 
Identification of PPARy target genes will contribute to better drug design and the development of 

20 novel therapeutic strategies for diabetes, obesity, and other conditions. 

Systematic attempts to identify PPARy target genes have been made in several rodent models 
of obesity and diabetes (Suzuki et al. (2000) Jpn. J. Pharmacol. 84:1 13-123; Way et al. (2001) 
Endocrinology 142:1269-1277). However, a serious drawback of the rodent gene expression studies is 
that significant differences exist between human and rodent models of adipogenesis, diabetes, and 

25 obesity (Taylor (1999) Cell 97:9-12; Gregoire et al. (1998) Physiol. Reviews 78:783-809). Therefore, 
an unbiased approach to identifying TZD regulated genes in primary cultures of human tissues is 
necessary to fully elucidate the molecular basis for diseases associated with PPARy activity. 

The majority of research in adipocyte biology to date has been done using transformed mouse 
preadipocyte cell lines. The culture condition, which stimulates mouse preadipocyte differentiation is 

30 different from that for inducing human primary preadipocyte differentiation. In addition, primary cells 
are diploid and may therefore reflect the in vivo context better than aneuploid cell lines. 
Understanding the gene expression profile during adipogenesis in human will lead to understanding the 
fundamental mechanism of adiposity regulation. Furthermore, through comparing the gene expression 
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profiles of adi pogenesis between donor with normal weight and donor with obesity, identification of 
crucial genes, potential drug targets for obesity and type 11 diabetes, will be possible. 
Dendritic Cells 

Dendritic cells (DC) are antigen presenting cells (APC) that play a key role in the primary 

5 immune response because of their unique ability to present antigens to naive T cells. In addition, DC 
differentiate into separate subsets that sustain and regulate immune responses following initial contact 
with antigen. DC subsets include those that preferentially induce particular T helper 1 (Thl) or T 
helper 2 (Th2) responses and those that regulate B cell responses. Moreover, DC are increasingly 
being used to manipulate immune responses, either to downregulate an aberrant autoimmune response 

10 or to enhance vaccination or a tumor-specific response. 

DC are functionally specialized in correlation with their particular differentiation state. CD34+ 
myeloid cells found in the bone marrow mature in response to as yet unclear signals into CD 14+ 
CD1 lc+ monocytes. An innate or antigen non-specific response takes place initially when monocytes 
circulate to nonlymphoid tissues and respond to lipopolysaccharide (LPS), a bacterially-derived 

15 mitogen, and viruses. Such direct encounter with antigen causes secretion of pro-inflammatory 

cytokines that attract and regulate natural killer cells, macrophages, and eosinophils in the first line of 
defense against invading pathogens. Monocytes then mature into DC, which capture antigen highly 
efficiently through endocytosis and antigen receptor uptake. Antigen processing and presentation 
trigger activation and differentiation into mature DC that express MHC class II molecules on the cell 

20 surface and efficiently activate T cells, initiating antigen-specific T cell and B cell responses. In turn, 
T cells activate DC through CD40 ligand - CD40 interactions, which stimulate expression of the 
costimulatory molecules CD80 and CD86, the latter most potent in amplifying T cell responses. DC 
interaction via CD40 with T cells also stimulates the production of inflammatory cytokines such as 
TNF alpha and 1H. Engagement of RANK, a member of the TNF receptor family by its ligand, 

25 TRANCE, which is expressed on activated T cells, enhances the survival of DC through inhibition of 
apoptosis, thereby enhancing T cell activation. The maturation and differentiation of monocytes into 
mature DC links the antigen non-specific innate immune response to the antigen-specific adaptive 
immune response. 

The process by which monocytes differentiate into immature dendritic cells in vivo has not 
30 been fully elucidated. Incubation of monocytes with granulocyte-macrophage colony stimulating 

factor (GM-CSF) and interleukin (IL) -4 in vitro yields cells that exhibit functional and morphological 
characteristics equivalent to immature dendritic cells found in vivo. Moreover, incubation in vitro of 
immature dendritic cells with tumor necrosis factor alpha (TNF-a), CD40 ligand, LPS, or monocyte- 
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conditioned medium yields mature dendritic cells that are potent activators of naive T cells. 

The ability to manipulate DC in vitro and their capacity to mount an effective immune 
response with small numbers of DC and little antigen has led to potential immunotherapies for diseases 
such as cancer, AIDS, and infectious diseases; and enhancing vaccine efficacy. Spontaneous 

5 remissions of particular cancers such as renal cell carcinomas and melanomas indicate that the 
immune system can respond to tumor antigens and eliminate tumors. However, tumors escape 
immune surveillance through a number of means including secretion of 1L-10, macrophage colony 
stimulating factor, IL-6, and vascular endothelial growth factor, all of which inhibit DC activity and 
promote tolerance of tumor tissue. Delivery of tumor antigen-loaded DC to tumors can induce tumor- 

10 specific rejection in animal models. Similarly, pathogens can escape immune surveillance by altering 
antigen processing and presentation pathways or interfering with maturation of antigen presenting 
cells. Rather than providing resistance, DC can complicate infection by hosting latent viruses such as 
Kaposi's virus and cytomegalovirus, complicating infection. H1V-1 and measles virus particles are 
efficiently produced in DC. Vaccines against tumors or infectious pathogens could be improved by 

15 systemic or local administration of DC loaded with tumor antigens or attenuated viral particles or 
components, respectively. 

The expression of killer-inhibitor regulatory molecules, chemokines, chemokine receptors, and 
proteinases have been identified in DC through sequencing of ESTs. Continuing this search may 
reveal new lymphocyte-binding and antigen-processing molecules, transmembrane and secretory 

20 products, and transcription factors that may help to explain the specialized features of DC and allow 
manipulation of the immune system. 
Alzheimer's Disease 

Characterization of region-specific gene expression in the human brain provides a context and 
background for molecular neurobiology on a variety of neurological disorders. For example, 

25 Alzheimer's disease (AD) is a progressive, neurodestructive process of the human neocortex, 
characterized by the deterioration of memory and higher cognitive function. A progressive and 
irreversible brain disorder, AD is characterized by three major pathogenic episodes involving (a) an 
aberrant processing and deposition of beta-amyloid precursor protein (betaAPP) to form neurotoxic 
beta-amyloid (beta A) peptides and an aggregated insoluble polymer of beta A that forms the senile 

30 plaque, (b) the establishment of intraneuronal neuritic tau pathology yielding widespread deposits of 
agyrophilic neurofibrillary tangles (NFT) and (c) the initiation and proliferation of a brain-specific 
inflammatory response. These three seemingly disperse attributes of AD etiopathogenesis are linked 
by the fact that proinflammatory microglia, reactive astrocytes and their associated cytokines and 
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chemokines are associated with the biology of the microtubule associated protein tau, betaA speciation 
and aggregation. Missense mutations in the presenilin genes PS1 and PS2, implicated in early onset 
familial AD, cause abnormal betaAPP processing with resultant overproduction of betaA42 and 
related neurotoxic peptides. Specific betaA fragments such as betaA42 can further potentiate 

5 proinflammatory mechanisms. Expression of the inducible oxidoreductase cyclooxygenase-2 and 
cytosolic phospholipase A2 (cPLA2) is strongly activated during cerebral ischemia and trauma, 
epilepsy and AD, indicating the induction of proinflammatory gene pathways as a response to brain 
injury. Neurotoxic metals such as aluminum and zinc, both implicated in AD etiopathogenesis, and 
arachidonic acid, a major metabolite of brain cPLA2 activity, each polymerize hyperphosphorylated 

10 tau to form NFT-like bundles. Studies have identified a reduced risk for AD in patients aged over 70 
years who were previously treated with non-steroidal anti-inflammatory drugs for non-CNS afflictions 
that include arthritis. (For a review of the interrelationships between the mechanisms of PS1, PS2 and 
betaAPP gene expression, tau and betaA deposition and the induction, regulation and proliferation in 
AD of the neuroinflammatory response, see Lukiw, WJ, and Bazan, N.G. (2000) Neurochem. Res. 

15 2000 25:1173-1184). 

There is a need in the art for new compositions, including nucleic acids and proteins, for the 
diagnosis, prevention, and treatment of cell proliferative, autoimmune/inflammatory, cardiovascular, 
neurological, and developmental disorders. 

20 SUMMARY OF THE INVENTION 

Various embodiments of the invention provide purified polypeptides, secreted proteins, 
referred to collectively as 'SECF and individually as 4 SECP-1/ 4 SECP-2/ 4 SECP-3,* 4 SECP-4/ 
4 SECP-5,' 'SECP-6/ 4 SECP-7/ 'SECP-8/ *SECP 9/ •SECP-10/ 4 SECP-1V 'SECP-12/ 4 SECP- 
13/ -SECP-14/ 'SECP-15,* 4 SECP-16,* 4 SECP-17,' 'SECP-18/ 'SECP-19/ 'SECP-20,* 'SECP-21,' 

25 4 SECP-22,* -SECP-23/ 4 SECP-24,' 'SECP-25/ 'SECP-26,' 4 SECP-27,' 'SECP-28,' 4 SECP-29/ 
•SECP-30,* 'SECP-31,' 'SECP-32/ 4 SECP-33/ 4 SECP-34,' 4 SECP-35,* 4 SECP-36,' 'SECP-37/ 
4 SECP-38/ 'SECP-39,* 'SECP-40/ 'SECP^!/ 4 SECP-42/ 4 SECP-43/ 4 SECP-44/ 4 SECP-45/ 
4 SECP-46/ 4 SECP-47/ 'SECP-48/ 4 SECP-49,* 4 SECP-50/ and 4 SECP-51* and methods for using 
these proteins and their encoding polynucleotides for the detection, diagnosis, and treatment of 

30 diseases and medical conditions. Embodiments also provide methods for utilizing the purified secreted 
proteins and/or their encoding polynucleotides for facilitating the drug discovery process, including 
determination of efficacy, dosage, toxicity, and pharmacology. Related embodiments provide methods 
for utilizing the purified secreted proteins and/or their encoding polynucleotides for investigating the 
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pathogenesis of diseases and medical conditions. 

An embodiment provides an isolated polypeptide selected from the group consisting of a) a 
polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
51, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical or at 

5 least about 90% identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-51, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-51, and d) an immunogenic fragment of a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:l-51. Another 
embodiment provides an isolated polypeptide comprising an amino acid sequence of SEQ ID NO: 1-51. 

10 Still another embodiment provides an isolated polynucleotide encoding a polypeptide selected 

from the group consisting of a) a polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-5 1, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical or at least about 90% identical to an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-51, c) a biologically active fragment of a polypeptide 

1 5 having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-5 1 , and d) an 
immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-51. In another embodiment, the polynucleotide encodes a polypeptide 
selected from the group consisting of SEQ ID NO.1-51. In an alternative embodiment, the 
polynucleotide is selected from the group consisting of SEQ ID N 0:52- 102. 

20 Still another embodiment provides a recombinant polynucleotide comprising a promoter 

sequence operably linked to a polynucleotide encoding a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from die group consisting 
of SEQ ID NO:l-51, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
90% identical or at least about 90% identical to an amino acid sequence selected from the group 

25 consisting of SEQ ID NO: 1 -5 1 , c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-51, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-51. 
Another embodiment provides a cell transformed with the recombinant polynucleotide. Yet another 
embodiment provides a transgenic organism comprising the recombinant polynucleotide. 

30 Another embodiment provides a method for producing a polypeptide selected from the group 

consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting 
of SEQ ID NO:l-51, b) a polypeptide comprising a naturally occurring amino acid sequence at least 
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90% identical or at least about 90% identical to an amino acid sequence selected from die group 
consisting of SEQ ID NO:l-5 1, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-51, and d) an immunogenic fragment of 
a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-51. 
5 The method comprises a) culturing a cell under conditions suitable for expression of the polypeptide, 
wherein said cell is transformed with a recombinant polynucleotide comprising a promoter sequence 
operably linked to a polynucleotide encoding the polypeptide, and b) recovering the polypeptide so 
expressed. 

Yet another embodiment provides an isolated antibody which specifically binds to a 

10 polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from die group consisting of SEQ ID NO:l-51, b) a polypeptide comprising a naturally 
occurring amino acid sequence at least 90% identical or at least about 90% identical to an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-51, c) a biologically active fragment of 
a polypeptide having an amino acid sequence selected from die group consisting of SEQ ID NO: 1-51, 

15 and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-5 1 . 

Still yet another embodiment provides an isolated polynucleotide selected from die group 
consisting of a) a polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:52-102, b) a polynucleotide comprising a naturally occurring polynucleotide 

20 sequence at least 90% identical or at least about 90% identical to a polynucleotide sequence selected 
from die group consisting of SEQ ID NO32-102, c) a polynucleotide complementary to die 
polynucleotide of a), d) a polynucleotide complementary to die polynucleotide of b), and e) an RNA 
equivalent of a>d). In other embodiments, die polynucleotide can comprise at least about 20, 30, 40, 
60, 80, or 100 contiguous nucleotides. 

25 Yet another embodiment provides a method for detecting a target polynucleotide in a sample, 

said target polynucleotide being selected from the group consisting of a) a polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:52-102, b) a polynucleotide 
comprising a naturally occurring polynucleotide sequence at least 90% identical or at least about 90% 
identical to a polynucleotide sequence selected from die group consisting of SEQ ID NO:52-102, c) a 

30 polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a>d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
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complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 
said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex. In a related embodiment, die method can include detecting the amount of 

5 the hybridization complex. In still other embodiments, die probe can comprise at least about 20, 30, 
40, 60, 80, or 100 contiguous nucleotides. 

Still yet another embodiment provides a method for detecting a target polynucleotide in a 
sample, said target polynucleotide being selected from the group consisting of a) a polynucleotide 
comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:52-102, b) a 

10 polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NCh52-102, c) a polynucleotide complementary to the polynucleotide of a), d) a polynucleotide 
complementary to the polynucleotide of b), and e) an RNA equivalent of a)-d). The method 
comprises a) amplifying said target polynucleotide or fragment thereof using polymerase chain 

15 reaction amplification, and b) detecting die presence or absence of said amplified target polynucleotide 
or fragment thereof. In a related embodiment, die method can include detecting die amount of the 
amplified target polynucleotide or fragment thereof. 

Another embodiment provides a composition comprising an effective amount of a polypeptide 
selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected 

20 from the group consisting of SEQ ID NO: 1-5 1 , b) a polypeptide comprising a naturally occurring 
amino acid sequence at least 90% identical or at least about 90% identical to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-51, c) a biologically active fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-51, 
and d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from die 

25 group consisting of SEQ ID NO:l-51, and a pharmaceutically acceptable excipient In one 

embodiment, the composition can comprise an amino acid sequence selected from the group consisting 
of SEQ ID NO:l-51. Other embodiments provide a method of treating a disease or condition 
associated with decreased or abnormal expression of functional SECP, comprising administering to a 
patient in need of such treatment the composition. 

30 Yet another embodiment provides a method for screening a compound for effectiveness as an 

agonist of a polypeptide selected from die group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-51, b) a polypeptide comprising a 
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naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-51, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-5 1, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 

5 selected from die group consisting of SEQ ID NCfcl-51. The method comprises a) exposing a sample 
comprising the polypeptide to a compound, and b) detecting agonist activity in the sample. Another 
embodiment provides a composition comprising an agonist compound identified by the method and a 
pharmaceutical^ acceptable excipient Yet another embodiment provides a method of treating a 
disease or condition associated with decreased expression of functional SECP, comprising 

10 administering to a patient in need of such treatment the composition. 

Still yet another embodiment provides a method for screening a compound for effectiveness 
as an antagonist of a polypeptide selected from die group consisting of a) a polypeptide comprising an 
amino add sequence selected from die group consisting of SEQ ID NO: 1-51, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical or at least about 90% 

15 identical to an amino acid sequence selected from die group consisting of SEQ ID NO:l-51, c) a 
biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-51, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-51. The method comprises a) 
exposing a sample comprising the polypeptide to a compound, and b) detecting antagonist activity in 

20 die sample. Another embodiment provides a composition comprising an antagonist compound 
identified by die method and a pharmaceutical^ acceptable excipient Yet another embodiment 
provides a method of treating a disease or condition associated with overexpression of functional 
SECP, comprising administering to a patient in need of such treatment the composition. 

Another embodiment provides a method of screening for a compound that specifically binds to 

25 a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-51, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-51, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 

30 ID NO:l-51, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-51. The method comprises a) combining die 
polypeptide with at least one test compound under suitable conditions, and b) detecting binding of the 
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polypeptide to the test compound, thereby identifying a compound that specifically binds to die 
polypeptide. 

Yet another embodiment provides a method of screening for a compound that modulates die 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 

5 acid sequence selected from the group consisting of SEQ ID NO:l-51, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical or at least about 90% identical to an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-51, c) a biologically active 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ 
ID NO:l-51, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 

10 selected from the group consisting of SEQ ID NO:l-5 1. The method comprises a) combining the 
polypeptide with at least one test compound under conditions permissive for die activity of the 
polypeptide, b) assessing die activity of the polypeptide in the presence of the test compound, and c) 
comparing the activity of die polypeptide in the presence of the test compound with die activity of the 
polypeptide in the absence of die test compound, wherein a change in the activity of die polypeptide in 

15 the presence of die test compound is indicative of a compound that modulates the activity of die 
polypeptide. 

Still yet another embodiment provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
polynucleotide sequence selected from die group consisting of SEQ ID NO:52-102, die method 
20 comprising a) exposing a sample comprising the target polynucleotide to a compound, b) detecting 
altered expression of the target polynucleotide, and c) comparing die expression of the target 
polynucleotide in the presence of varying amounts of die compound and in the absence of die 
compound. 

Another embodiment provides a method for assessing toxicity of a test compound, said 
25 method comprising a) treating a biological sample containing nucleic acids with the test compound; b) 
hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from die group consisting of i) a polynucleotide 
comprising a polynucleotide sequence selected from die group consisting of SEQ ID NOi52-102, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical or at 
30 least about 90% identical to a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:52-102, iii) a polynucleotide having a sequence complementary to i), iv) a polynucleotide 
complementary to die polynucleotide of ii), and v) an RNA equivalent of i>i v). Hybridization occurs 
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under conditions whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide selected from the group consisting of 
i) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:52-102, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 

5 90% identical or at least about 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:52- 102, iii) a polynucleotide complementary to the polynucleotide of i), iv) a 
polynucleotide complementary to the polynucleotide of ii), and v) an RNA equivalent of i)-iv). 
Alternatively, the target polynucleotide can comprise a fragment of a polynucleotide selected from the 
group consisting of i>v) above; c) quantifying the amount of hybridization complex; and d) comparing 

10 the amount of hybridization complex in the treated biological sample with the amount of hybridization 
complex in an untreated biological sample, wherein a difference in die amount of hybridization 
complex in die treated biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

15 Table 1 summarizes the nomenclature for full length polynucleotide and polypeptide 

embodiments of the invention. 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
homolog, and the PROTEOME database identification numbers and annotations of PROTEOME 
database homologs, for polypeptides of the invention. The probability scores for the matches between 
20 each polypeptide and its homolog(s) are also shown. 

Table 3 shows structural features of polypeptide embodiments, including predicted motifs and 
domains, along with the methods, algorithms, and searchable databases used for analysis of the 
polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
25 polynucleotide embodiments, along with selected fragments of the polynucleotides. 

Table 5 shows representative cDNA libraries for polynucleotide embodiments. 
Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze polynucleotides and 
30 polypeptides, along with applicable descriptions, references, and threshold parameters. 

Table 8 shows single nucleotide polymorphisms found in polynucleotide sequences of the 
invention, along with allele frequencies in different human populations. 
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DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleic acids, and methods are described, it is understood that 
embodiments of the invention are not limited to the particular machines, instruments, materials, and 
5 methods described, as these may vary. It is also to be understood that the terminology used herein is 
for die purpose of describing particular embodiments only, and is not intended to limit the scope of the 
invention. 

As used herein and in die appended claims, the singular forms "a," "an," and "the" include 
plural reference unless the context clearly dictates otherwise. Thus, for example, a reference to "a 

10 host celT includes a plurality of such host cells, and a reference to "an antibody is a reference to one 
or more antibodies and equivalents thereof known to those skilled in die art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in die art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 

15 used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing die 
cell lines, protocols, reagents and vectors which are reported in the publications and which might be 
used in connection with various embodiments of the invention. Nothing herein is to be construed as an 
admission that the invention is not entitled to antedate such disclosure by virtue of prior invention. 

20 DEFINITIONS 

"SECP" refers to die amino acid sequences of substantially purified SECP obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and human, 
and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
25 SECP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of SECP either by directly interacting with 
SECP or by acting on components of die biological pathway in which SECP participates. 

An "allelic variant" is an alternative form of the gene encoding SECP. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
30 polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
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Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding SECP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as SECP or a 

5 polypeptide with at least one functional characteristic of SECP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding SECP, and improper or unexpected hybridization to allelic variants, with a 
locus other than die normal chromosomal locus for the polynucleotide encoding SECP. The encoded 
protein may also be "altered," and may contain deletions, insertions, or substitutions of amino acid 

10 residues which produce a silent change and result in a functionally equivalent SECP. Deliberate 
amino acid substitutions may be made on the basis of one or more similarities in polarity, charge, 
solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of die residues, as long as die 
biological or immunological activity of SECP is retained. For example, negatively charged amino acids 
may include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 

IS argjnine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 

include: asparagine and ghitamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
aid phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence** can refer to an oligopeptide, a peptide, a 

20 polypeptide, or a protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid. Amplification 

25 may be carried out using polymerase chain reaction (PGR) technologies or other nucleic acid 
amplification technologies well known in die art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of SECP. Antagonists may include proteins such as antibodies, anticalins, nucleic acids, 
carbohydrates, small molecules, or any other compound or composition which modulates the activity of 

30 SECP either by directly interacting with SECP or by acting on components of the biological pathway 
in which SECP participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
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thereof, such as Fab, FCaV)^ and Fv fragments, which are capable of binding an epitopic determinant 
Antibodies that bind SECP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
5 or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

10 immunize a host animal, numerous regions of die protein may induce die production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures on 
die protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used 
to elicit the immune response) for binding to an antibody. 

The term "aptamer" refers to a nucleic acid or oligonucleotide molecule that binds to a 

IS specific molecular target Aptamers are derived from an in vitro evolutionary process (e.g., SELEX* : 
(Systematic Evolution of Ligands by Exponential Enrichment), described in U.S. Patent No. 
5,270,163), which selects for target-specific aptamer sequences from large combinatorial libraries. 
Aptamer compositions may be double-stranded or single-stranded, and may include 
deoxyribonuckotides, ribonucleotides, nucleotide derivatives, or other nucleotide-like molecules. The 

20 nucleotide components of an aptamer may have modified sugar groups (e.g., the 2 -OH group of a 
ribonucleotide may be replaced by 2'-F or 2 -NH,), which may improve a desired property, e.g., 
resistance to nucleases or longer lifetime in blood. Aptamers may be conjugated to other molecules, 
e.g., a high molecular weight carrier to slow clearance of the aptamer from die circulatory system. 
Aptamers may be specifically cross-linked to their cognate ligands, e.g., by photo-activation of a 

25 cross-linker (Brody, E.N. and L. Gold (2000) J. Biotechnol. 74:5-13). 

The term "intramer" refers to an aptamer which is expressed in vivo. For example, a 
vaccinia virus-based RNA expression system has been used to express specific RNA aptamers at 
high levels in the cytoplasm of leukocytes (Blind, M. etaL (1999) Proc. Natl. Acad. Sci. USA 
96:3606-3610). 

30 The term "spiegelmer" refers to an aptamer which includes L-DNA, L-RNA, or other left- 

handed nucleotide derivatives or nucleotide-like molecules. Aptamers containing left-handed 
nucleotides are resistant to degradation by naturally occurring enzymes, which normally act on 
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substrates containing right-handed nucleotides. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a polynucleotide having a specific nucleic acid sequence. Antisense compositions 
may include DNA; RNA; peptide nucleic acid (PNA); oligonucleotides having modified backbone 

5 linkages such as phosphorothioates, mcthylphosphonates, or benzylphosphonates ; oligonucleotides 
having modified sugar groups such as 2 -methoxyethyl sugars or 2-methoxyethoxy sugars; or 
oligonucleotides having modified bases such as 5-methyl cytosine, 2 , -deoxyuracil, or 7-deaza-2 - 
deoxyguanosine. Antisense molecules may be produced by any method including chemical synthesis 
or transcription. Once introduced into a cell, the complementary antisense molecule base-pairs with a 

10 naturally occurring nucleic acid sequence produced by die cell to form duplexes which block either 
transcription or translation. The designation "negative" or "minus" can refer to the antisense strand, 
and the designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule, likewise, "immunologically active" or "immunogenic" 

15 refers to the capability of the natural, recombinant, or synthetic SECP, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes die relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3' pairs with its complement, 
20 3'-TCA-5\ 

A "composition comprising a given polynucleotide" and a "composition comprising a given 
polypeptide" can refer to any composition containing the given polynucleotide or polypeptide. The 
composition may comprise a dry formulation or an aqueous solution. Compositions comprising 
polynucleotides encoding SECP or fragments of SECP may be employed as hybridization probes. The 

25 probes maybe stored in freeze-dried form and maybe associated with a stabilizing agent such as a 
carbohydrate. In hybridizations, the probe may be deployed in an aqueous solution containing salts 
(e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's 
solution, dry milk, salmon sperm DNA, etc.)* 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to 

30 repeated DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied 
Biosystems, Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from one or more overlapping cDNA, EST, or genomic DNA fragments using a computer 
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program for fragment assembly, such as the GELVlJbW fragment assembly system (Accelrys, 
Burlington MA) or Phrap (University of Washington, Seattle WA). Some sequences have been both 
extended and assembled to produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 
5 interfere with the properties of the original protein, Le., the structure and especially die function of die 
protein is conserved and not significantly changed by such substitutions. Hie table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 


10 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


15 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, Us 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




Be 


Leu, Val 


20 


Leu 


lie, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, Be 




Phe 


His, Met, Leu, Tip, Tyr 




Ser 


Cys, Thr 


25 


Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Tip 




Val 


De, Leu, Thr 



30 Conservative amino acid substitutions generally maintain (a) die structure of die polypeptide 

backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobici ty of the molecule at die site of the substitution, and/or (c) the bulk of 
the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
35 absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide can include, for example, replacement of hydrogen by an 
alky], acyl, hydroxy!, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
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one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 
5 "Differential expression" refers to increased or upregulated; or decreased, downregulated, or 

absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a 
diseased and a normal sample. 

"Exon shuffling" refers to the recombination of different coding regions (exons). Since an 

10 exon may represent a structural or functional domain of the encoded protein, new proteins may be 
assembled through the novel reassortment of stable substructures, thus allowing acceleration of the 
evolution of new protein functions. 

A "fragment" is a unique portion of SECP or a polynucleotide encoding SECP which can be 
identical in sequence to, but shorter in length than, the parent sequence. A fragment may comprise up 

15 to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from about 5 to about 1000 contiguous nucleotides or amino acid residues. A 
fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 
5, 10, 15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino 
acid residues in length. Fragments may be preferentially selected from certain regions of a molecule. 

20 For example, a polypeptide fragment may comprise a certain length of contiguous amino acids 

selected from die first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a 
certain defined sequence. Clearly these lengths are exemplary, and any length that is supported by die 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

25 A fragment of SEQ ID NO:52-102 can comprise a region of unique polynucleotide sequence 

that specifically identifies SEQ ID NO32-102, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:52-102 can be employed 
in one or more embodiments of methods of the invention, for example, in hybridization and 
amplification technologies and in analogous methods that distinguish SEQ ID NO:52-102 from related 

30 polynucleotides. The precise length of a fragment of SEQ ID NO:52-102 and the region of SEQ ID 
NO:52-102 to which the fragment corresponds are routinely determinable by one of ordinary skill in 
the art based on the intended purpose for die fragment. 
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A fragment of SEQ ID NO:l-51 is encoded by a fragment of SEQ ID NO:52-102. A 
fragment of SEQ ID NO: 1-5 1 can comprise a region of unique amino acid sequence that specifically 
identifies SEQ ID NO:l-51. For example, a fragment of SEQ ID NO:l-51 can be used as an 
immunogenic peptide for the development of antibodies that specifically recognize SEQ ID NO: 1-51. 
5 The precise length of a fragment of SEQ ID NO.1-51 and the region of SEQ ID NO:l-51 to which 
the fragment corresponds can be determined based on the intended purpose for the fragment using 
one or more analytical methods described herein or otherwise known in the art 

A "full length" polynucleotide is one containing at least a translation initiation codon (e.g., 
methionine) followed by an open reading frame and a translation termination codon. A "full length" 
10 polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, alternatively, sequence identity, between two or 
more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of identical nucleotide matches between at least two polynucleotide sequences aligned 
15 using a standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, 
gaps in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using one or more 
computer algorithms or programs known in the art or described herein. For example, percent identity 
20 can be determined using the default parameters of the GLUSTAL V algorithm as incorporated into 
the MEGALIGN version 3.12e sequence alignment program. This program is part of the 
LASERGENE software package, a suite of molecular biological analysis programs (DNASTAR, 
Madison WI). CLUSTAL V is described in Kggins, D.G. and P.M. Sharp (1989; CABIOS 5:151- 
153) and in Kggins, D.G. et al. (1992; CABIOS 8:189-191). For pairwise alignments of 
25 polynucleotide sequences, the default parameters are set as follows: Ktuple=2, gap penalty=5, 

window=4, and "diagonals saved"=4. The "weighted" residue weight table is selected as the default 
Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
which can be used is provided by the National Center for Biotechnology Information (NCBI) Basic 
Local Alignment Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which 
30 is available from several sources, including die NCBI, Bethesda, MD, and on the Internet at 

httpyAvww.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
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polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences'' that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http7Avww.ncbi.nlm.nih.gov/gorf /bChtmL The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
5 programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 
10 Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 
15 Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ED number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
feast 20, at least 30, at least 40, at least SO, at least 70, at least 100, or at least 200 contiguous 

20 nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in die tables, figures, or Sequence listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to die degeneracy of the genetic code. It is understood that changes 

25 in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of identical residue matches between at least two polypeptide sequences aligned using 
a standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 

30 alignment methods take into account conservative amino acid substitutions. Such conservative 

substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving die structure (and therefore function) of the polypeptide. The 
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phrases "percent similarity* and "% similarity," as applied to polypeptide sequences, refer to the 
percentage of residue matches, including identical residue matches and conservative substitutions, 
between at least two polypeptide sequences aligned using a standardized algorithm. In contrast, 
conservative substitutions are not included in the calculation of percent identity between polypeptide 
5 sequences. 

. Percent identity between polypeptide sequences may be determined using die default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3. 12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l , gap 
10 penahy=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0. 12 ( April-2 1 -2000) with blastp set at default parameters. Such default parameters may be, for 
15 example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
20 Word Size: 3 

Filter: on 

Percent identity may be measured over die length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, 
for example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
25 instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least SO, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
30 DNA sequences of about 6 kb to 10 Mb in size and which contain all of die elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which die amino acid 
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sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
5 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of die hybridization process, with more stringent conditions allowing less non-specific 
binding, Le., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 

10 conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in 
die art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve die desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 pglml sheared, denatured salmon sperm DNA. 

15 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (TJ for the specific sequence at a defined ionic 
strength and pR The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 

20 conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. and D.W. 
Russell (2001; Molecular Cloning: A Laboratory Manual, 3rd ed., vol 1-3, Cold Spring Harbor Press, 
Cold Spring Harbor NY, ch. 9). 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 

25 Alternatively, temperatures of about 65°C 60°C 55 °C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 /ig/ml. Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 

30 such as for RNAiDNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
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strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acids by 
virtue of the formation of hydrogen bonds between complementary bases. A hybridization complex 
may be formed in solution (e.g., or R<J analysis) or formed between one nucleic acid present in 
5 solution and another nucleic acid immobilized on a solid support (e.g., paper, membranes, filters, chips, 
pins or glass slides, or any other appropriate substrate to which cells or their nucleic acids have been 
fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or polynucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

10 "Immune response" can refer to conditions associated with inflammation, trauma, immune 

disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of SECP which is 

15 capable of eliciting an immune response when introduced into a living organism, for example, a 

mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
of SECP which is useful in any of the antibody production methods disclosed herein or known in die 
art 

Hie term "micro array" refers to an arrangement of a plurality of polynucleotides, 
20 polypeptides, antibodies, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, antibody, or 
other chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in die activity of SECP. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
25 functional, or immunological properties of SECP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DN A or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent die sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-Hke material. 
30 "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if die promoter affects die transcription or expression of the coding 
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sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 

5 amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification*' of an SECP may involve Kpidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 

10 ait These processes may occur synthetically or biochemically. Biochemical modifications will vary 
by cell type depending on the enzymatic milieu of SECP. 

"Probe" refers to nucleic adds encoding SECP, their complements, or fragments thereof, 
which are used to detect identical, allelic or related nucleic acids. Probes are isolated oligonucleotides 
or polynucleotides attached to a detectable label or reporter molecule. Typical labels include 

15 radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are short nucleic 
acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a 
DNA polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic 
acid, e.g. , by the polymerase chain reaction (PCR). 

20 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 

25 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in, for example, Sambrook, 
J. and D.W. Russell (2001; Molecular Cloning: A Laboratory Manual , 3rd ed., vol 1-3, Cold Spring 
Harbor Press, Cold Spring Harbor NY), Ausubel, RM. et al. (1999; Short Protocols in Molecular 
Biology . 4* ed., John Wiley & Sons, New York NY), and Innis, M. et al. (1990; PCR Protocols. A 

30 Guide to Methods and Applications . Academic Press. San Diego CA). PCR primer pairs can be 

derived from a known sequence, for example, by using computer programs intended for that purpose 
such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 
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Oligonucleotides for use as primers are selected using software known in die art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
5 programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas 
South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome- wide scope. The Primer3 primer 
selection program (available to die public from the Whitehead Institute/MIT Center for Genome 

10 Research, Cambridge MA) allows the user to input a "misprinting library,** in which sequences to 
avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 

15 Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either die most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 

20 as PCR or sequencing primers, micro array elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are 
not limited to those described above. 

A "recombinant nucleic acid" is a nucleic acid that is not naturally occurring or has a 
sequence that is made by an artificial combination of two or more otherwise separated segments of 

25 sequence. This artificial combination is often accomplished by chemical synthesis or, more commonly, 
by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering 
techniques such as those described in Sambrook and Russell (supra). The term recombinant includes 
nucleic acids that have been altered solely by addition, substitution, or deletion of a portion of the 
nucleic acid. Frequently, a recombinant nucleic acid may include a nucleic acid sequence operably 

30 linked to a promoter sequence. Such a recombinant nucleic acid may be part of a vector that is used, 
for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
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vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
5 (UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RN A stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cof actors; inhibitors; magnetic particles; and 
10 other moieties known in the art. 

An "RNA equivalent," in reference to a DNA molecule, is composed of the same linear 
sequence of nucleotides as die reference DNA molecule with the exception that all occurrences of 
the nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

15 The term "sample" is used in its broadest sense. A sample suspected of containing SEGP, 

nucleic acids encoding SECP, or fragments thereof may comprise a bodily fluid; an extract from a cell, 
chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, in 
solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 

20 protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 

synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by die binding molecule. For 
example, if an antibody is specific for epitope "A ," the presence of a polypeptide comprising the 
epitope A, or die presence of free unlabeled A, in a reaction containing free labeled A and the 

25 antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably at least about 75% free, and most preferably at least about 90% free from other 
components with which they are naturally associated. 

30 A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 

by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
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chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
micropaiticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A 'transcript image" or "expression profile" refers to the collective pattern of gene expression 

5 by a particular cell type or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the ait, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell The method for transformation is selected based 

10 on the type of host cell being transformed and may include, but is not limited to, bacteriophage or viral 
infection, electroporation, heat shock, lipofection, and particle bombardment The term "transformed 
cells" includes stably transformed cells in which die inserted DNA is capable of replication either as 
an autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RN A for limited periods of time. 

IS A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of die organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art The 
nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of the cell, 
by way of deliberate genetic manipulation, such as by microinjection or by infection with a 

20 recombinant virus. In another embodiment, the nucleic acid can be introduced by infection with a 
recombinant viral vector, such as a lenti viral vector (Lois, C. et al. (2002) Science 295:868-872). The 
term genetic manipulation does not include classical cross-breeding, or in vitro fertilization, but rather 
is directed to the introduction of a recombinant DNA molecule. The transgenic organisms 
contemplated in accordance with die present invention include bacteria, cyanobacteria, fungi, plants 

25 and animals. The isolated DNA of the present invention can be introduced into the host by methods 
known in die art, for example infection, transection, transformation or transconjugation. Techniques 
for transferring the DNA of the present invention into such organisms are widely known and provided 
in references such as Sambrook and Russell (supra). 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 

30 at least 40% sequence identity to die particular nucleic acid sequence over a certain length of one of 
the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
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least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 
93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater 
sequence identity over a certain defined length. A variant may be described as, for example, an 
"allelic" (as defined above), "splice " "species," or "polymorphic" variant A splice variant may have 

5 significant identity to a reference molecule, but win generally have a greater or lesser number of 
polynucleotides due to alternate splicing during mRNA processing. The corresponding polypeptide 
may possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotides that vary from one species to another. The resulting polypeptides 
will generally have significant amino acid identity relative to each other. A polymorphic variant is a 

10 variation in the polynucleotide sequence of a particular gene between individuals of a given species. 
Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in which the 
polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be indicative of, 
for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

15 at least 40% sequence identity or sequence similarity to the particular polypeptide sequence over a 
certain length of one of die polypeptide sequences using blastp with the "BLAST 2 Sequences" tool 
Version 2.0.9 (May-07-1999) set at default parameters. Such a pair of polypeptides may show, for 
example, at least 50%, at least 60%, at feast 70%, at feast 80%, at least 85%, at feast 90%, at least 
91%, at least 92%, at feast 93%, at least 94%, at feast 95%, at least 96%, at least 97%, at feast 98%, 

20 or at feast 99% or greater sequence identity or sequence similarity over a certain defined length of one 
of die polypeptides. 

THE INVENTION 

Various embodiments of die invention include new human secreted proteins (SECP), the 
25 polynucleotides encoding SECP, and the use of these compositions for the diagnosis, treatment, or 
prevention of ceU proliferative, autoimmune/inflammatory, cardiovascular, neurological, and 
developmental disorders. 

Table 1 summarizes die nomenclature for the full length polynucleotide and polypeptide 
embodiments of the invention. Each polynucleotide and its corresponding polypeptide are correlated to 
30 a single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is 
denoted by both a polypeptide sequence identification number (Polypeptide SEQ ED NO:) and an 
Incyte polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide 
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sequence is denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ED 
NO:) and an Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 
Column 6 shows the Incyte ID numbers of physical, fall length clones corresponding to the polypeptide 
and polynucleotide sequences of the invention. The fall length clones encode polypeptides which have 

5 at least 95% sequence identity to the polypeptide sequences shown in column 3. 

Table 2 shows sequences with homology to polypeptide embodiments of the invention as 
identified by BLAST analysis against the GenBank protein (genpept) database and the PROTEOME 
database. Columns 1 and 2 show die polypeptide sequence identification number (Polypeptide SEQ 
ID NO:) and the corresponding Incyte polypeptide sequence number (Incyte Polypeptide ID) for 

10 polypeptides of the invention. Column 3 shows the GenBank identification number (GenBank ID NO:) 
of die nearest GenBank homolog and the PROTEOME database identification numbers 
(PROTEOME ID NO:) of die nearest PROTEOME database homologs. Column 4 shows die 
probability scores for the matches between each polypeptide and its homolog(s). Column 5 shows die 
annotation of the GenBank and PROTEOME database homolog(s) along with relevant citations where 

15 applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns I and 

2 show die polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 

3 shows die number of amino acid residues in each polypeptide. Column 4 shows potential 

20 phosphorylation sites and glycosylation sites, as determined by the MOTIFS program of die GCG 
sequence analysis software package (Accehys, Burlington MA), as well as amino acid residues 
comprising signature sequences, domains, and motifs, including the locations of signal peptides (as 
indicated by "Signal Peptide" and/or "signal_cleavage"). Column 5 shows analytical methods for 
protein structure/function analysis and in some cases, searchable databases to which the analytical 

25 methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are secreted proteins. For example, SEQ ID NO:24 
is 100% identical, from residue Ml to residue Rl 10, to human MYG1 (probable metal-dependent 
hydrolase) homolog (GenBank ID g!0444289) as determined by the Basic Local Alignment Search 

30 Tool (BLAST). (See Table 2.) The BLAST probability score is 3.7e-58, which indicates the 

probability of obtaining die observed polypeptide sequence alignment by chance. Data from BLAST, 
analysis of the PRODOM database provides further corroborative evidence that SEQ ID NO:24 is a 
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secreted hydrolase. 

As another example, SEQ ID NO.46 is 70% identical, from residue G45 to residue P1317 and 
is 39% identical, from residue W13 to residue E351, to rat MEGF6 (GenBank ID g3449294) as 
determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 

5 probability scores are both 0.0, which indicate the probabilities of obtaining the observed polypeptide 
sequence alignments by chance. As determined by BLAST analysis using the PROTEOME 
database, SEQ ID NO:46 has homology to rat MEGF6 (multiple epidermal growth factor (EGF>Uke 
domains 6) which contains 30 epidermal growth factor-like motifs, is a putative secreted protein, and is 
predicted to bind calcium (PROTEOME ID: 662841 [EgfB). SEQ ID NO:46 also has homology to a 

10 human protein which contains five laminin epidermal growth factor (EGF>like and IS epidermal 
growth factor (EGF>like domains (PROTEOME ID: 716683(MEGF11). SEQ ID NO:46 also 
contains EGF-Bke domains and laminin-type EGF-Eke domains as determined by searching for 
statistically significant matches in die hidden Markov model (HMM>based PFAM and SMART 
databases of conserved protein families/domains. (See Table 3.) Data from BLIMPS, MOTIFS, and 

15 additional BLAST analyses against PRODOM and DOMO databases provide further corroborative 
evidence that SEQ ID NO:46 is a protein containing EGF-Eke domains. 

SEQ ID NO:l-23, SEQ ID NO:25-45, and SEQ ID NO:47-51 were analyzed and annotated in 
a similar manner. The algorithms and parameters for the analysis of SEQ ID NO: 1-51 are described 
in Table 7. 

20 As shown in Table 4, die full length polynucleotide embodiments were assembled using cDNA 

sequences or coding (exon) sequences derived from genomic DNA, or any combination of these two 
types of sequences. Column 1 lists the polynucleotide sequence identification number (Polynucleotide 
SEQ ID NO:), the corresponding Incyte polynucleotide consensus sequence number (Incyte ID) for 
each polynucleotide of the invention, and die length of each polynucleotide sequence in basepairs. 

25 Column 2 shows the nucleotide start (5') and stop (3') positions of the cDNA and/or genomic 
sequences used to assemble the foil length polynucleotide embodiments, and of fragments of the 
polynucleotides which are useful, for example, in hybridization or amplification technologies that 
identify SEQ ID NO:52-102 or that distinguish between SEQ ID NO52-102 and related 
polynucleotides. 

30 The polynucleotide fragments described in Column 2 of Table 4 may refer specifically, for 

example, to Incyte cDNAs derived from tissue-specific cDNA libraries or from pooled cDNA 
libraries. Alternatively, die polynucleotide fragments described in column 2 may refer to GenBank 
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cDNAs or ESTs which contributed to die assembly of die full length polynucleotides. In addition, the 
polynucleotide fragments described in column 2 may identify sequences derived from the ENSEMBL 
(The Sanger Centre, Cambridge, UK) database (ie. t those sequences including die designation 
"ENST"). Alternatively, the polynucleotide fragments described in column 2 may be derived from the 

5 NCBI RefSeq Nucleotide Sequence Records Database (Le. , those sequences inchiding the 

designation "NM" or "NT") or die NCBI RefSeq Protein Sequence Records (Le. , those sequences 
including the designation **NP"). Alternatively, the polynucleotide fragments described in column 2 
may refer to assemblages of both cDNA and Genscan-predicted exons brought together by an "exon 
stitching** algorithm. For example, a polynucleotide sequence identified as 

10 FLJOOOOOCJVtJf^YYYYYJV^N. represents a "stitched" sequence in which XXXXXX is die 

identification number of die cluster of sequences to which the algorithm was applied, and YYYYY is the 
number of die prediction generated by die algorithm, and N w _, if present, represent specific exons 
that may have been manually edited during analysis (See Example V). Alternatively, the 
polynucleotide fragments in column 2 may refer to assemblages of exons brought together by an 

IS "exon-stretching" algorithm, For example, a polynucleotide sequence identified as 

FLXXXXXX_gAAAAA_gBBBBBJi_N is a "stretched" sequence, with XXXXXX being the Incyte 
project identification number, gfiAAAA being the GenBank identification number of the human 
genomic sequence to which die "exon-stretching" algorithm was applied, gBBBBB being the GenBank 
identification number or NCBI RefSeq identification number of the nearest GenBank protein homolog, 

20 and N referring to specific exons (See Example V). In instances where a RefSeq sequence was used 
as a protein homolog for the "exon-stretching" algorithm, a RefSeq identifier (denoted by "NM ," 
"NP," or "NT*) may be used in place of die GenBank identifier (Le., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 
genomic DNA sequences, or derived from a combination of sequence analysis methods. The 

25 following Table lists examples of component sequence prefixes and corresponding sequence analysis 
methods associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK) 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 
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INCY 



Full length transcript and exon prediction from mapping of EST 
sequences to die genome. Genomic location and EST composition 
data are combined to predict the exons and resulting transcript 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in 
Table 4 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte 

5 cDNA identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotides which 
were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte 
cDNA library which is most frequently represented by the Incyte cDNA sequences which were used 
to assemble and confirm the above polynucleotides. The tissues and vectors which were used to 

10 construct the cDNA libraries shown in Table 5 are described in Table 6. 

Table 8 shows single nucleotide polymorphisms (SNPs) found in polynucleotide sequences of 
the invention, along with allele frequencies in different human populations. Columns 1 and 2 show the 
polynucleotide sequence identification number (SEQ ID NO:) and die corresponding Incyte project 
identification number (PCD) for polynucleotides of the invention. Column 3 shows the Incyte 

15 identification number tor the EST in which the SNP was detected (EST ID), and column 4 shows the 
identification number for the SNP (SNP ID). Column 5 shows the position within the EST sequence 
at which the SNP is located (EST SNP), ami column 6 shows the position of the SNP within the full- 
length polynucleotide sequence (CB1 SNP). Column 7 shows the allele found in the EST sequence. 
Columns 8 and 9 show the two alleles found at the SNP ate. Column 10 shows the amino acid 

20 encoded by the codon including the SNP site, based upon die allele found in the EST. Columns 1 1-14 
show the frequency of allele 1 in four different human populations. An entry of n/d (not detected) 
indicates that the frequency of allele 1 in die population was too low to be detected, while n/a (not 
available) indicates that the allele frequency was not determined for die population. 

The invention also encompasses SECP variants. Various embodiments of SECP variants can 

25 have at least about 80%, at least about 90%, or at least about 95% amino acid sequence identity to the 
SECP amino acid sequence, and can contain at least one functional or structural characteristic of 
SECP. 

Various embodiments also encompass polynucleotides which encode SECP. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
30 from the group consisting of SEQ ID NO.S2-1Q2, which encodes SECP. The polynucleotide 
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sequences of SEQ ID NO:52-102, as presented in die Sequence listing, embrace the equivalent RNA 
sequences, wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the 
sugar backbone is composed of ribose instead of deoxyribose. 

The invention also encompasses variants of a polynucleotide encoding SECP. In particular, 

5 such a variant polynucleotide will have at least about 70%, or alternatively at least about 85%, or even 
at least about 95% polynucleotide sequence identity to a polynucleotide encoding SECP. A particular 
aspect of the invention encompasses a variant of a polynucleotide comprising a sequence selected 
from the group consisting of SEQ ID NO J2-102 which has at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to a nucleic acid 

10 sequence selected from the group consisting of SEQ ID NO:52-102. Any one of the polynucleotide 
variants described above can encode a polypeptide which contains at least one functional or structural 
characteristic of SECP. 

In addition, or in die alternative, a polynucleotide variant of the invention is a splice variant of a 
polynucleotide encoding SECP. A splice variant may have portions which have significant sequence 

15 identity to a polynucleotide encoding SECP, but will generally have a greater or lesser number of 
polynucleotides due to additions or deletions of blocks of sequence arising from alternate splicing 
during mRNA processing. A splice variant may have less than about 70%, or alternatively less than 
about 60%, or alternatively less than about 50% polynucleotide sequence identity to a polynucleotide 
encoding SECP over its entire length; however, portions of the splice variant will have at least about 

20 70%, or alternatively at least about 85%, or alternatively at least about 95%, or alternatively 100% 
polynucleotide sequence identity to portions of die polynucleotide encoding SECP. For example, a 
polynucleotide comprising a sequence of SEQ ID NO:52 and a polynucleotide comprising a sequence 
of SEQ ID NO:53 are splice variants of each other; a polynucleotide comprising a sequence of SEQ 
ID NO:54, a polynucleotide comprising a sequence of SEQ ID NO:55, a polynucleotide comprising a 

25 sequence of SEQ ID NO:56, and a polynucleotide comprising a sequence of SEQ ID NO:83 are splice 
variants of each other; a polynucleotide comprising a sequence of SEQ ID NO:63 and a 
polynucleotide comprising a sequence of SEQ ID NO:64 are splice variants of each other, a 
polynucleotide comprising a sequence of SEQ ID NO:66 and a polynucleotide comprising a sequence 
of SEQ ID NO:67 are splice variants of each other; and a polynucleotide comprising a sequence of 

30 SEQ ID NO:74 and a polynucleotide comprising a sequence of SEQ ID NO:75 are splice variants of 
each other; and a polynucleotide comprising a sequence of SEQ ID NO:96 and a polynucleotide 
comprising a sequence of SEQ ID NO:97 are splice variants of each other. Any one of the splice 
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variants described above can encode a polypeptide which contains at feast one functional or structural 
characteristic of SECP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding SECP, some bearing minimal similarity 

5 to the polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, 
the invention contemplates each and every possible variation of polynucleotide sequence that could be 
made by selecting combinations based on possible codon choices. These combinations are made in 
accordance with die standard triplet genetic code as applied to the polynucleotide sequence of 
naturally occurring SECP, and all such variations are to be considered as being specifically disclosed. 

1 0 Although polynucleotides which encode SECP and its variants are generally capable of 

hybridizing to polynucleotides encoding naturally occurring SECP under appropriately selected 
conditions of stringency, it may be advantageous to produce polynucleotides encoding SECT or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 
codons. Codons may be selected to increase the rate at which expression of the peptide occurs in a 

15 particular prokaryotic or eukaryotic host in accordance with the frequency with which particular 
codons are utilized by the host Otter reasons for substantially altering the nucleotide sequence 
encoding SECP and its derivatives without altering die encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from die naturally occurring sequence. 

20 The invention also encompasses production of polynucleotides which encode SECP and SECP 

derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
polynucleotide may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art Moreover, synthetic chemistry may be used to introduce 
mutations into a polynucleotide encoding SECP or any fragment thereof. 

25 Embodiments of die invention can also include polynucleotides that are capable of hybridizing 

to the claimed polynucleotides, and, in particular, to those having die sequences shown in SEQ ID 
NO:52-102 and fragments thereof, under various conditions of stringency (Wahl, G.M. and S.L. 
Berger (1987) Methods Enzymol. 152399-407; Kimmel, A.R. (1987) Methods Enzymol. 152:507-511: 
Hybridization conditions, including annealing and wash conditions, are described in 'Definitions/' 

30 Methods for DNA sequencing are well known in the art and may be used to practice any of 

the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
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Biosystems), thermostable T7 polymerase (Amersham Biosciences, Piscataway NJ), or combinations 
of polymerases and proofreading exonucteases such as those found in the ELONGASE amplification 
system (Invitrogen, Carlsbad CA). Preferably, sequence preparation is automated with machines 
such as the NDCROLAB 2200 liquid transfer system (Hamilton, Reno NV), PTC200 thermal cycler 

5 (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler (Applied Biosystems). 
Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing system (Applied 
Biosystems), the MEGABACE 1000 DNA sequencing system (Amersham Biosciences), or other 
systems known in die art The resulting sequences are analyzed using a variety of algorithms which 
are well known in the art (Ausubel et aL, supra, ch. 7; Meyers, RA. (1995) Molecular Biology and 

10 Biotechnology, Wiley VCH, New York NY, pp. 856-853). 

The nucleic acids encoding SECP may be extended utilizing a partial nucleotide sequence and 
employing various PCR-based methods known in die art to detect upstream sequences, such as 
promoters and regulatory elements. For example, one method which may be employed, restriction-site 
PCR, uses universal and nested primers to amplify unknown sequence from genomic DNA within a 

15 cloning vector (Sarkar, G. (1993) PCR Methods Applic. 2:318-322). Another method, inverse PCR, 
uses primers that extend in divergent directions to amplify unknown sequence from a circularized 
template. The template is derived from restriction fragments comprising a known genomic locus and 
surrounding sequences (TrigHa, T. et aL (1988) Nucleic Acids Res. 16:8186). A third method, capture 
PCR, involves PCR amplification of DNA fragments adjacent to known sequences in human and 

20 yeast artificial chromosome DNA (Lagerstrom, M. et aL (1991) PCR Methods Applic. 1:111-119). In 
this method, multiple restriction enzyme digestions and ligations may be used to insert an engineered 
double-stranded sequence into a region of unknown sequence before performing PCR. Other 
methods which may be used to retrieve unknown sequences are known in the art (Parker, J.D. et al. 
(1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, nested primers, and 

25 PROMOTERHNDER libraries (Oontech, Palo Alto CA) to walk genomic DNA. This procedure 
avoids the need to screen libraries and is useful in finding intron/exon junctions. For all PCR-based 
methods, primers maybe designed using commercially available software, such as OLIGO 4.06 
primer analysis software (National Biosciences, Plymouth MN) or another appropriate program, to be 
about 22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 

30 template at temperatures of about 68°C to 72°C 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
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sequences containing the 5 'regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5 " non-transcribed regulatory regions. 

Capillary electrophoresis systems winch are commercially available may be used to analyze 

5 die size or confirm the nucleotide sequence of sequencing or PGR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 

10 process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DN A fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotides or fragments thereof which encode 
SECP may be cloned in recombinant DN A molecules that direct expression of SECP, or fragments or . 

IS functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the genetic 
code, other polynucleotides which encode substantially the same or a functionally equivalent 
polypeptides may be produced and used to express SECP. 

The polynucleotides of the invention can be engineered using methods generally known in the 
art in order to alter SECP-encoding sequences for a variety of purposes including, but not limited to, 

20 modification of the cloning, processing, and/or expression of the gene product DNA shuffling by 
random fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may be 
used to engineer the nucleotide sequences. For example, ohgonuckotide-mediated site-directed 
mutagenesis may be used to introduce mutations that create new restriction sites, alter glycos ylation 
patterns, change codon preference, produce splice variants, and so forth. 

25 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent No. 
5,837,458; Chang, C.-C. et aL (1999) Nat BiotechnoL 17:793-797; Christians, F.C. et aL (1999) Nat 
BiotechnoL 17:259-264; and Crameri, A. et at (1996) Nat BiotechnoL 14:315-319) to alter or improve 
the biological properties of SECP, such as its biological or enzymatic activity or its ability to bind to 

30 other molecules or compounds. DNA shuffling is a process by which a library of gene variants is 
produced using PCR-mediated recombination of gene fragments. Hie library is then subjected to 
selection or screening procedures that identify those gene variants with the desired properties. These 
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preferred variants may then be pooled and further subjected to recursive rounds of DNA shuffling and 
selection/screening. Thus, genetic diversity is created through "artificial" breeding and rapid molecular 
evolution. For example, fragments of a single gene containing random point mutations may be 
recombined, screened, and then reshuffled until die desired properties are optimized. Alternatively, 

5 fragments of a given gene may be recombined with fragments of homologous genes in the same gene 
family, either from the same or different specks, thereby maximizing the genetic diversity of multiple 
naturally occurring genes in a directed and controllable manner. 

In another embodiment, polynucleotides encoding SECP may be synthesized, in whole or in 
part, using one or more chemical methods well known in the art (Caruthers, M.K et aL (1980) 

10 Nucleic Acids Symp. Ser. 7215-223; Horn, T. et aL (1980) Nucleic Acids Symp. Ser. 7225-232). 
Alternatively, SECP itself or a fragment thereof may be synthesized using chemical methods known in 
die art For example, peptide synthesis can be performed using various solution-phase or solid-phase 
techniques (Creighton, T. (1984) Proteins, Structures and Molecular Properties. WH Freeman, New 
Yolk NY, pp. 55-60; Roberge, J.Y. et al. (1995) Science 269202-204). Automated synthesis may be 

15 achieved using the ABI 431 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid 
sequence of SECP, or any part thereof, may be altered during direct synthesis and/or combined with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide 
having a sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 

20 chromatography (Chiez, R.M. and FX Regnier (1990) Methods Enzymol. 1 82:392-421). The 
composition of die synthetic peptides maybe confirmed by amino acid analysis or by sequencing 
(Creighton, supra y pp. 28-53). 

In order to express a biologically active SECP, the polynucleotides encoding SECP or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 

25 the necessary elements for transcriptional and translational control of die inserted coding sequence in 
a suitable host These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotides encoding 
SECP. Such elements may vary in their strength and specificity. Specific initiation signals may also 
be used to achieve more efficient translation of polynucleotides encoding SECP. Such signals include 

30 the ATG initiation codon and adjacent sequences, e.g. die Kozak sequence. In cases where a 

polynucleotide sequence encoding SECP and its initiation codon and upstream regulatory sequences 
are inserted into the appropriate expression vector, no additional transcriptional or translational control 
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signals may be needed. However, in cases where only coding sequence, or a fragment thereof, is 
inserted, exogenous translational control signals including an in-frame ATG initiation codon should be 
provided by the vector. Exogenous translational elements and initiation codons may be of various 
origins, both natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 

5 enhancers appropriate for the particular host cell system used (Scharf, D. et aL (1994) Results ProbL 
Cell Differ. 20:125-162). 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing polynucleotides encoding SECP ami appropriate transcriptional and translational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 

10 and in vivo genetic recombination (Sambrook and Russell, supra, ch. 1-4, and 8; Ausubel et aL, 
supra, ch. 1, 3, and 15). 

A variety of expression vector/host systems may be utilized to contain and express 
polynucleotides encoding SECP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; 

15 yeast transformed with yeast expression vectors; insect cell systems infected with viral expression 
vectors (e.g., baculovinis); plant cell systems transformed with viral expression vectors (e.g., 
cauliflower mosaic vims, CaMV, or tobacco mosaic vims, TMV) or with bacterial expression vectors 
(e.g., Ti or pBR322 plasmids); or animal cell systems (Sambrook and Russell, supra; Ausubel et aL, 
supra\ Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509; Engelhard, EK. et aL 

20 (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et aL (1996) Hum. Gene Ther. 7:1937- 
1945;Takamatsu, N. (1987) EMBO J. 6:307-311: The McGraw Hill Yearbook of Science and 
Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. 
NatL Acad. Sci. USA 813655-3659; Harrington, JJ. et aL (1997) Nat Genet 15345-355). 
Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or from 

25 various bacterial plasmids, may be used for delivery of polynucleotides to the targeted organ, tissue, or 
cell population (Di Nicola, M. et aL (1998) Cancer Gen. Ther. 5:350-356; Yu, M et aL (1993) Proc. 
NatL Acad. Sci. USA 90:6340-6344; BuDer, KM. et aL (1985) Nature 317:813-815; McGregor, D.P. 
et aL (1994) Mol. ImmunoL 31219-226; Verma, LM. and N. Somia (1997) Nature 389:239-242). The 
invention is not limited by the host cell employed. 

30 In bacterial systems, a number of cloning and expression vectors may be selected depending 

upon die use intended for polynucleotides encoding SECP. For example, routine cloning, subcloning, 
and propagation of polynucleotides encoding SECP can be achieved using a multifunctional & coli 
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vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid (Invitrogen). 
ligation of polynucleotides encoding SEGP into fee vector's multiple cloning site disrupts the lacL 
gene, allowing a colorimetric screening procedure for identification of transformed bacteria containing 
recombinant molecules. In addition, these vectors may be useful for in vitro transcription, dideoxy 

5 sequencing, single strand rescue wife helper phage, and creation of nested deletions in fee cloned 
sequence (Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509). When large 
quantities of SECP are needed, e.g. for the production of antibodies, vectors which direct high level 
expression of SECP may be used. For example, vectors containing fee strong, inducible SP6 or T7 
bacteriophage promoter may be used. 

10 Yeast expression systems may be used for production of SECP. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomyces cerevisiae or Pichia pastoris. In addition, such 
vectors direct either fee secretion or intracellular retention of expressed proteins and enable integration 
of foreign polynucleotide sequences into fee host genome for stable propagation (Ausubel et aL, 

15 supra; Bitter, G.A. et al. (1987) Methods EnzymoL 153:516-544; Scorer, CA. et aL (1994) 
Bio/Technology 12:181-184). 

Plant systems may also be used for expression of SECP. Transcription of polynucleotides 
encoding SECP may be driven by viral promoters, e.g., fee 35S and 19S promoters of CaMV used 
alone or in combination wife the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 

20 6307-3 11). Alternatively, plant promoters such as fee small subunit of RUBISCO or heat shock 
promoters may be used (Coruzzi, G. et aL (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Science 224:838-843; Winter, J. et aL (1991) Results ProbL Cell Differ. 17:85-105). These constructs 
can be introduced into plant cells by direct DNA transformation or pathogen-mediated transaction 
(The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 

25 191-196). 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, polynucleotides encoding SECP may be ligated 
into an adenovirus transcription/translation complex consisting of fee late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of fee viral genome may be used to obtain 
30 infective virus which expresses SECP in host cells (Logan, J. and T. Shenk (1984) Proc. Natl. Acad. 
Sci. USA 81:3655-3659). In addition, transcription enhancers, such as fee Rous sarcoma virus (RSV) 
enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV-based vectors 
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may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may ako be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, potycationic amino polymers, 

5 or vesicles) for therapeutic purposes (Harrington, J.J. et al. (1997) Nat Genet 15:345-355). 

For long term production of recombinant proteins in mammalian systems, stable expression of 
SECP in cell lines is preferred. For example, polynucleotides encoding SECP can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 

10 introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 
selective agent, and its presence allows growth and recovery of cells which successfully express die 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 
. 15 Any number of selection systems may be used to recover transformed cell lines. These 

include, but are not limited to, die herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in rtr and apr cells, respectively (Wigler, M. et at (1977) 
Cell 11:223-232; Lowy, L et aL (1980) Cell 22:817-823). Also, antimetabolite, antibiotic, or herbicide 
resistance can be used as the basis for selection. For example, dhfr confers resistance to 

20 methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and ah and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Wigler, M. et al. 
(1980) Proc. Natl Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. MoL BioL 
150:1-14). Additional selectable genes have been described, e.g., trpB and hisD y which alter cellular 
requirements for metabolites (Hartman, S.C and R.C Mulligan (1988) Proc. Natl. Acad. Sci. USA 

25 85:8047-8051). Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), p- 
ghicuronidase and its substrate 0-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be u sed not only to identify transformants , but also to quantify die amount of 
transient or stable protein expression attributable to a specific vector system (Rhodes, C.A. (1995) 
Methods MoL Biol. 55:121-131). 

30 Although the presence/absence of marker gene expression suggests that the gene of interest 

is also present, the presence and expression of die gene may need to be confirmed. For example, if 
the sequence encoding SECP is inserted within a marker gene sequence, transformed cells containing 
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polynucleotides encoding SECP can be identified by the absence of marker gene function. 
Alternatively, a marker gene can be placed in tandem with a sequence encoding SECP under the 
control of a single promoter. Expression of the marker gene in response to induction or selection 
usually indicates expression of the tandem gene as well. 

S In general, host cells that contain the polynucleotide encoding SECP and that express SECP 

may be identified by a variety of procedures known to those of skill in the art Hiese procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR amplification, and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein sequences. 

10 Immunological methods for detecting and measuring die expression of SECP using either 

specific polyclonal or monoclonal antibodies are known in die art Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on SECP is preferred, but a competitive 

15 binding assay may be employed. Tliese and other assays are well known in the art (Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual , APS Press, St Paul MN, Sect IV; CoHgan, 
J.E. et al. (1997) Current Protocols in Immunology . Greene Pub. Associates and Wiley-Interscienee, 
New York NY; Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ). 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 

20 may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding SECP include 
oligolabeHng, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, polynucleotides encoding SECP, or any fragments thereof, may be cloned into a vector 
for die production of an mRNA probe. Such vectors are known in die art, are commercially available, 

25 ami may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Biosciences, Promega (Madison 
WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease of 
detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 

30 well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with polynucleotides encoding SECP may be cultured under conditions 
suitable for die expression and recovery of die protein from cell culture. The protein produced by a 
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transformed cell may be secreted or retained intracelhilarly depending on the sequence and/or the 
vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode SECP may be designed to contain signal sequences which direct 
secretion of SECP through a prokaryotic or eukaryotic cell membrane. 

5 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted polynucleotides or to process the expressed protein in the desired fashion. Such modifications 
of die polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 

10 Different host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WD 8) are available from the 
American Type Culture Collection (ATCC, Manassas VA> and may be chosen to ensure die correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant polynucleotides 

15 encoding SECP may be ligated to a heterologous sequence resulting in translation of a fusion protein in 
any of the aforementioned host systems. For example, a chimeric SECP protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of SECP activity. Heterologous protein and peptide 
moieties may also facilitate purification of fusion proteins using commercially available affinity 

20 matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 

binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-Ms enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 

25 proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between die SECP encoding sequence and die heterologous protein sequence, so that SECP 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel et al. (supra, ch. 10 and 16). A variety of 

30 commercially available kits may also be used to facilitate expression and purification of fusion proteins. 
In another embodiment, synthesis of radiolabeled SECP may be achieved in vitro using die 
TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems couple 
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transcription and translation of protein-coding sequences operably associated with the T7, T3, or SP6 
promoters. Translation takes place in die presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

SECP, fragments of SECP, or variants of SECP may be used to screen for compounds that 

5 specifically bind to SECP. One or more test compounds may be screened for specific binding to 

SECP. In various embodiments, 1, 2, 3, 4, 5, 10, 20, 50, 100, or 200 test compounds can be screened 
for specific binding to SECP. Examples of test compounds can include antibodies, anticalins, 
oligonucleotides, proteins (e.g., Kgands or receptors), or small molecules. 

In related embodiments, variants of SECP can be used to screen for binding of test 

10 compounds, such as antibodies, to SECP, a variant of SECP, or a combination of SECP and/or one or 
more variants SECP. In an embodiment, a variant of SECP can be used to screen for compounds that 
bind to a variant of SECP, but not to SECP having die exact sequence of a sequence of SEQ ID 
NO:l-51. SECP variants used to perform such screening can have a range of about 50% to about 
99% sequence identity to SECP, with various embodiments having 60%, 70%, 75%, 80%, 85%, 90%, 

IS and 95% sequence identity. 

In an embodiment, a compound identified in a screen for specific binding to SECP can be 
closely related to the natural ligand of SECP, e.g., a ligand or fragment thereof, a natural substrate, a 
structural or functional mimetic, or a natural binding partner (Coligan, J.E. et al. (1991) Current 
Protocols in Immunology l(2):Chapter 5). In another embodiment, the compound thus identified can 

20 be a natural ligand of a receptor SECP (Howard, A.D. et aL (2001) Trends Pharmacol Sci.22:132- 
140; Wise, A. et al. (2002) Drug Discovery Today 7:235-246). 

In other embodiments, a compound identified in a screen for specific binding to SECP can be 
closely related to the natural receptor to which SECP binds, at least a fragment of the receptor, or a 
fragment of die receptor including all or a portion of die ligand binding site or binding pocket For 

25 example, die compound may be a receptor for SECP which is capable of propagating a signal, or a 
decoy receptor for SECP which is not capable of propagating a signal (Ashkenazi, A. and V.M. Divit 
(1999) Curr. Opin. Cell Biol 11:255-260; Mantovani, A. et al. (2001) Trends Immunol. 22:328-336). 
The compound can be rationally designed using known techniques. Examples of such techniques 
include those used to construct the compound etanercept (ENBREL; Amgen Inc., Thousand Oaks 

30 CA), which is efficacious for treating rheumatoid arthritis in humans. Etanercept is an engineered p75 
tumor necrosis factor (TNF) receptor dimer linked to the Fc portion of human IgG t (Taylor, P.C. et al. 
(2001) Curr. Opin. Immunol 13:611-616). 
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In one embodiment, two or more antibodies having similar or, alternatively, different 
specificities can be screened for specific binding to SECP, fragments of SECP, or variants of SECP. 
The binding specificity of the antibodies thus screened can thereby be selected to identify particular 
fragments or variants of SECP. In one embodiment, an antibody can be selected such that its binding 
5 specificity allows for preferential identification of specific fragments or variants of SECP. In another 
embodiment, an antibody can be selected such that its binding specificity allows for preferential 
diagnosis of a specific disease or condition having increased, decreased, or otherwise abnormal 
production of SECP. 

In an embodiment, anticaHns can be screened for specific binding to SECP, fragments of 
10 SECP, or variants of SECP. AnticaHns are ligand-binding proteins that have been constructed based 
on a lipocalin scaffold (Weiss, G.A. and HJB. Lowman (2000) Chem. BioL 7JU77-R184; Skerra, A. 
(2001) J. BiotechnoL 74:257-275). The protein architecture of tipocalins can include a beta-barrel 
having eight antiparallel beta-strands, which supports four loops at its open end. These loops form the 
natural ligand-binding site of the Hpocalins, a site which can be re-engineered in vitro by amino acid 
15 substitutions to impart novel binding specificities. The amino acid substitutions can be made using 
methods known in the art or described herein, and can include conservative substitutions (e.g., 
substitutions that do not alter binding specificity) or substitutions that modestly, moderately, or 
significantly alter binding specificity. 

In one embodiment, screening for compounds which specifically bind to, stimulate, or inhibit 
20 SECP involves producing appropriate cells which express SECP, either as a secreted protein or on the 
cell membrane. Preferred cells can include cells from mammals, yeast, Drosophila, or K colL Cells 
expressing SECP or cell membrane fractions which contain SECP are then contacted with a test 
compound and binding, stimulation, or inhibition of activity of either SECP or the compound is 
analyzed. 

25 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label For example, the 
assay may comprise the steps of combining at least one test compound with SECP, either in solution 
or affixed to a solid support, and detecting the binding of SECP to the compound. Alternatively, the 
assay may detect or measure binding of a test compound in the presence of a labeled competitor. 

30 Additionally, the assay may be carried out using cell-free preparations, chemical libraries, or natural 
product mixtures, and die test compound(s) maybe free in solution or affixed to a solid support 

An assay can be used to assess the ability of a compound to bind to its natural ligand and/or to 
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inhibit die binding of its natural Hgand to its natural receptors. Examples of such assays include radio- 
labeling assays such as those described in U.S. Patent No. 5,914,236 and U.S. Patent No. 6372,724. 
In a related embodiment, one or more amino acid substitutions can be introduced into a polypeptide 
compound (such as a receptor) to improve or alter its ability to bind to its natural ligands (Matthews, 

5 DJ. and J. A. Wells. (1994) Chem. BioL 125-30). In another related embodiment, one or more amino 
acid substitutions can be introduced into a polypeptide compound (such as a ligand) to improve or alter 
its ability to bind to its natural receptors (Cunningham, B.C. and J.A. Wells (1991) Proc. NatL Acad. 
Sci. USA 88:3407-3411; Lowman, KB. et aL (1991) J. Biol Chem. 266:10982-10988). 

SECP, fragments of SECP, or variants of SECP may be used to screen for compounds that 

10 modulate the activity of SECP. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for SECP 
activity, wherein SECP is combined with at least one test compound, and the activity of SECP in die 
presence of a test compound is compared with the activity of SECP in the absence of the test 
compound. A change in the activity of SECP in the presence of the test compound is indicative of a 

15 compound that modulates the activity of SECP. Alternatively, a test compound is combined with an in 
vitro or cell-free system comprising SECP under conditions suitable for SECP activity, and the assay 
is performed. In either of these assays, a test compound which modulates the activity of SECP may 
do so indirectly and need not come in direct contact with the test compound. At least one and up to a 
plurality of test compounds may be screened. 

20 In another embodiment, polynucleotides encoding SECP or their mammalian homologs may be 

"knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in die art and are useful for the generation of animal models of 
human disease (see, e.g., U.S. Patent No. 5,175383 and U.S. Patent No. 5,767337). For example, 
mouse ES cells, such as the mouse 129/SvJ cell fine, are derived from the early mouse embryo and 

25 grown in culture. The ES cells are transformed with a vector containing the gene of interest disrupted 
by a marker gene, e.g., the neomycin phosphotransferase gene (neo\ Capecchi, MR. (1989) Science 
244:1288-1292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 

30 (1996) Clin. Invest 97:1999-2002; Wagner, ILU et aL (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and 
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the resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous 
strains. Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

Polynucleotides encoding SECP may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 

5 lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1145-1147). 

Polynucleotides encoding SECP can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 

10 of a polynucleotide encoding SECP is injected into animal ES cells, and die injected sequence 

integrates into the animal cell genome. Transformed cells are injected into blastulae, and die blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, 
a mammal inbred to overexpress SECP, e.g. , by secreting SECP in its milk, may also serve as a 

15 convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of SECP and secreted proteins. In addition, examples of tissues expressing SECP 
promonocyte line treated with PM A, peripheral blood mononuclear cells treated with interleukin 10, 

20 breast carcinoma and primary mammary epithelial cells, and normal and tumorous colon tissues, 
cancerous and normal breast, colon, lung, ovarian, prostate tissues, adipocytes, peripheral blood 
mononuclear cells, dendritic cells, monocytes, C3A hepatoma cell line, Raji B lymphoblast cells, and 
human umbilical vein endothelial cells. Further examples of tissues expressing SECP can be found in 
Table 6 and can also be found in Example XL Therefore, SECP appears to play a role in cell 

25 proliferative, autoimmune/inflammatory, cardiovascular, neurological, and developmental disorders. In 
the treatment of disorders associated with increased SECP expression or activity, it is desirable to 
decrease the expression or activity of SECP. In the treatment of disorders associated with decreased 
SECP expression or activity, it is desirable to increase the expression or activity of SECP. 

Therefore, in one embodiment, SECP or a fragment or derivative thereof may be administered 

30 to a subject to treat or prevent a disorder associated with decreased expression or activity of SECP. 
Examples of such disorders include, but are not limited to, a cell proliferative disorder such as actinic 
keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
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(MCTD), myelofibrosis, paroxysmal nocturnal hemogtobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma, and, in particular, a cancer of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 

5 lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, 
thyroid, and uterus; an autoimmune/inflammatory disorder such as acquired immunodeficiency 
syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, ankylosing 
spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 

10 cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes meHitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, 
atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, 

15 Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, 

systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, 
Werner syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, 
bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; a cardiovascular disorder 
such as congestive heart failure, ischemic heart disease, angina pectoris, myocardial infarction, 

20 hypertensive heart disease, degenerative valvular heart disease, calcific aortic valve stenosis, 

congenially bicuspid aortic valve, mitral annular calcification, mitral valve prolapse, rheumatic fever 
and rheumatic heart disease, infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis 
of systemic lupus erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, 
neoplastic heart disease, congenital heart disease, complications of cardiac transplantation, 

25 arteriovenous fistula, atherosclerosis, hypertension, vasculitis, Raynaud's disease, aneurysms, arterial 
dissections, varicose veins, thrombophlebitis and phkbothrombosis, vascular tumors, and complications 
of thrombolysis, balloon angioplasty, vascular replacement, and coronary artery bypass graft surgery; a 
neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 
Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 

30 extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
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abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker 
syndrome, fatal familial insomnia, nutritional and metabolic diseases of the nervous system, 
neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, encephalotrigeminal 

5 syndrome, mental retardation and other developmental disorders of the central nervous system 
including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular 
disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 

10 mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; and a developmental disorder such as renal tubular acidosis, anemia, 
Cushing's syndrome, achondroplastic dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, 

15 gonadal dysgenesis, WAGR syndrome (Wilms" tumor, aniridia, genitourinary abnormalities, and mental 
retardation), Smith-Magenis syndrome, myelodysplastic syndrome, hereditary mucoepithelial dysplasia, 
hereditary keratodermas, hereditary neuropathies such as Charcot-Marie-Tooth disease and 
neurofibromatosis, hypothyroidism, hydrocephalus, seizure disorders such as Syndenham's chorea and 
cerebral palsy, spina bifida, anencephaly, craniorachischisis, congenital glaucoma, cataract, and 

20 sensorineural hearing loss. 

In another embodiment, a vector capable of expressing SECP or a fragment or derivative 
thereof maybe administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of SECP including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified SECP in 

25 conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent 
a disorder associated with decreased expression or activity of SECP including, but not limited to, those 
provided above. 

In still another embodiment, an agonist which modulates the activity of SECP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
30 activity of SECP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of SECP may be administered to a subject to treat or 
prevent a disorder associated with increased expression or activity of SECP. Examples of such 
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disorders include, but are not limited to, those cell proliferative, autoimmuneyinflammatory, 
cardiovascular, neurological, and developmental disorders described above. In one aspect, an antibody 
which specifically binds SECP may be used directly as an antagonist or indirectly as a targeting or 
delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express SECP. 

5 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding SECP may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of SECP including, but not limited to, those described above. 

In other embodiments, any protein, agonist, antagonist, antibody, complementary sequence, or 
vector embodiments may be administered in combination with other appropriate therapeutic agents. 

10 Selection of die appropriate agents for use in combination therapy may be made by one of ordinary 
skill in the art, according to conventional pharmaceutical principles. The combination of therapeutic 
agents may act synergisticaDy to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with lower dosages of 
each agent, thus reducing the potential for adverse side effects. 

15 An antagonist of SECP may be produced using methods which are generally known in die art 

In particular, purified SECP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind SECP. Antibodies to SECP may also 
be generated using methods that are well known in the art Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments 

20 produced by a Fab expression library. In an embodiment neutralizing antibodies (i.e., those which 
inhibit dimer formation) can be used therapeutically. Single chain antibodies (e.g., from camels or 
Hamas) may be potent enzyme inhibitors and may have application in die design of peptide mimetics, 
and in die development of immuDo- adsorbents and biosensors (Muyldermans, S. (2001) J. BiotechnoL 
74:277-302). 

25 For the production of antibodies, various hosts including goats, rabbits, rats, mice, camels, 

dromedaries, llamas, humans, and others may be immunized by injection with SECP or with any 
fragment or oligopeptide thereof which has immunogenic properties. Depending on die host species, 
various adjuvants may be used to increase immunological response. Such adjuvants include, but are 
not limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 

30 as lysolecithin, phironic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among 
adjuvants used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are 
especially preferable. 
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It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to SECP 
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
substantially identical to a portion of the amino acid sequence of the natural protein. Short stretches of 

5 SECP amino acids may be fused with those of another protein, such as KLH, and antibodies to die 
chimeric molecule may be produced. 

Monoclonal antibodies to SECP may be prepared using any technique which provides for die 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 

10 technique (Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 
81:31-42; Cote, RJ. et al. (1983) Proc. NatL Acad. ScL USA 805026-2030; Cok, S.P. et aL (1984) 
MoL Cell BioL 62:109-120). 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 

15 antigen specificity and biological activity, can be used (Morrison, S.L. et al. (1984) Proc. NatL Acad. 
Sci. USA 81:6851-6855; Neuberger, M.S. et aL (1984) Nature 312:604-608; Takeda, S. et al. (1985) 
Nature 3 14:452-454). Alternatively, techniques described for the production of single chain antibodies 
may be adapted, using methods known in the art, to produce SECP-specific single chain antibodies. 
Antibodies with related specificity, but of distinct idiotypic composition, may be generated by chain 

20 shuffling from random combinatorial immunoglobulin libraries (Burton, D.R. (1991) Proc. NatL Acad. 
Sci. USA 88:10134-10137). 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature (Qrlandi, R. et al. (1989) Proc. NatL Acad. Sci. USA 86:3833-3837; Winter, G. et al. 

25 (1991) Nature 349:293-299). 

Antibody fragments which contain specific binding sites for SECP may also be generated. 
For example, such fragments include, but are not limited to, FCab 1 )* fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries maybe constructed to allow rapid and 

30 easy identification of monoclonal Fab fragments with the desired specificity (Huse, W.D. et al. (1989) 
Science 246:1275-1281). 

Various immunoassays may be used for screening to identify antibodies having the desired 
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specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurement of complex formation between SECP and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 

5 to two non-interfering SECP epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for SECP. Affinity is expressed as an association 
constant, K,, which is defined as die molar concentration of SECP-antibody complex divided by the 

10 molar concentrations of free antigen and free antibody under equilibrium conditions. The K, 

determined for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for 
multiple SECP epitopes, represents the average affinity, or avidity, of the antibodies for SECP. The 
K, determined for a preparation of monoclonal antibodies, which are monospecific for a particular 
SECP epitope, represents a true measure of affinity. High-affinity antibody preparations with K, 

15 ranging from about 10 9 to 10 n I/mole are preferred for use in immunoassays in which the SECP- 
antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations with K, 
ranging from about 10 6 to UP L/mole are preferred for use in immunopurification and similar 
procedures which ultimately require dissociation of SECP, preferably in active form, from the antibody 
(Catty, D. (1988) Antibodies, Volume I: A Practical Approach . IRL Press, Washington DC; Liddell, 

20 J.E. and A. Oyer (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, New York 
NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
25 specific antibody/ml, is generally employed in procedures requiring precipitation of SECP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available (Catty, supra; CoHgan et aL, 
supra). 

In another embodiment of die invention, polynucleotides encoding SECP, or any fragment or 
30 complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 
RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
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SECP. Such technology is well known in die art, and antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
SECP (Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press, Totawa NJ). 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 

5 sequences into appropriate target cells can be used. Antisense sequences can be delivered 

intraceDularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein (Slater, J.E. et 
aL (1998) J. Allergy Clin. ImmunoL 102:469-475; Scanlon, KJ. et al (1995) 9:1288-1296). Antisense 
sequences can also be introduced intraceDularly through the use of viral vectors, such as retrovirus and 

10 adeno-associated virus vectors (Miller, A.D. (1990) Blood 76:27 1 ; Ausubel et aL, supra; Uckert, W. 
and W. Wahher (1994) Pharmacol Ther. 63323-347). Other gene delivery mechanisms include 
liposome-derived systems, artificial viral envelopes, and other systems known in die art (Rossi, J. J. 
(1995) Br. Med. Bull. 51217-225; Boado, RJ. et al (1998) J. Pharm. Sci. 87:1308-1315; Morris, 
M.C. et aL (1997) Nucleic Acids Res. 252730-2736). 

15 In another embodiment of the invention, polynucleotides encoding SECP may be used for 

somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X- 
hnked inheritance (Cavazzana-Calvo, M. et al (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 

20 (Blaese, R.M. et al (1995) Science 270:475-480; Bordignon, C. et aL (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al (1993) Cell 75207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al (1995) Hum. Gene Therapy 6:667-703), thalassemias, familial 
hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor IX deficiencies (Crystal, 
RG. (1995) Science 270:404-410; Verma, LM. and N. Somia (1997) Nature 389:239-242)), (ii) 

25 express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescMa, E. et al (1996) Proc. Natl. Acad. Sci. USA 93:11395-11399), hepatitis 
B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 

30 brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzi)- In 
die case where a genetic deficiency in SECP expression or regulation causes disease, the expression 
of SECP from an appropriate population of transduced cells may alleviate the clinical manifestations 
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caused by die genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
SECP are treated by constructing mammalian expression vectors encoding SECP and introducing 
these vectors by mechanical means into SECP-deficient cells. Mechanical transfer technologies for 
5 use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and 
(v) the use of DNA transposons (Morgan, R.A. and WJF. Anderson (1993) Annu. Rev. Biochem. 
62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J.-L. and R R&apon (1998) Curr. Opin. 
Biotechnol. 9:445-450). 

10 Expression vectors that may be effective for the expression of SECP include, but are not 

limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX, PCR2-TOPOTA vectors 
(Invitrogen, Carlsbad CA), PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), 
and PTET-OFF, PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). SECP 
may be expressed using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous 

15 sarcoma virus (RSV), SV40 virus, thymidine kinase (TK), or ^-actin genes), (ii) an inducible promoter 
(e.g., the tetracycliiie-iegulated promoter (Gossen, M. and R Bujard (1992) Proc. Natl. Acad. Sci. 
USA 89:5547-5551; Gossen, M. etaL (1995) Science 268:1766-1769; Rossi, F.M.V. and RM. Blau 
(1998) Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); 
the ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 

20 FK506/rapamycin inducible promoter, or the RU4 8 6/mif epristone inducible promoter (Rossi, F.M. V. 
and RM. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding SECP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 

25 polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, RL. and A J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

30 In another embodiment of the invention, diseases or disorders caused by genetic defects with 

respect to SECP expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding SECP under the control of an independent promoter or the retrovirus long 
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terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cu-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, L et al. (1995) Proc. 

5 Nad. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 
receptors on die target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al 
(1987) J. ViroL 61:1647-1650; Bender, M.A. et aL (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. ViroL 62:3802-3806; DuD, T. et al (1998) J. Virol. 72:8463-8471; Zufferey, R. et 

10 al. (1998) J. Virol. 72:9873-9880). U.S. Patent No. 5,910,434 to Rigg ("Method for obtaining 

retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses 
a method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-ceBs), and the 
return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 

15 therapy and have been well documented (Ranga, U. et al (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, ML. (1997) J. Virol 71:4707-4716; Ranga, U. et al. (1998) 
Proc. Natl Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In an embodiment, an adeno virus-based gene therapy delivery system is used to deliver 
polynucleotides encoding SECP to cells which have one or more genetic abnormalities with respect to 

20 the expression of SECP. The construction and packaging of adeno virus-based vectors are well 

known to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to 
be versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent No. 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 

25 hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al (1999; Annu. 
Rev. Nutr. 19:511-544) and Verma, LNL and N. Somia (1997; Nature 18:389:239-242). 

In another embodiment, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding SECP to target ceDs which have one or more genetic abnormalities with 
respect to the expression of SECP. Hie use of herpes simplex virus (HSV>based vectors may be 

30 especially valuable for introducing SECP to cells of die central nervous system, for which HSV has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in die art A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
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been used to deliver a reporter gene to the eyes of primates (Liu, X. et aL (1999) Exp. Eye Res. 
169:385-395). The construction of a HSV-1 virus vector has ako been disclosed in detail in U.S. 
Patent No. 5,804,413 to DeLuca ('Herpes simplex vinis strains for gene transfer")* which is hereby 
incorporated by reference. U.S. Patent No. 5,804,413 teaches the use of recombinant HSV d92 

5 which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
die control of the appropriate promoter for purposes including human gene therapy. Also taught by 
this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 ami 
ICP22. For HSV vectors, see also Goins, W.F. et aL (1999; J. Virol. 73:519-532) and Xu, Ret al. 
(1994; Dev. Biol. 163:152-161). The manipulation of cloned herpesvirus sequences, the generation of 

10 recombinant virus following the transfection of multiple plasmids containing different segments of the 
large herpesvirus genomes, the growth and propagation of herpesvirus, and the infection of cells with 
herpesvirus are techniques well known to those of ordinary skin in die art 

In another embodiment, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding SECP to target cells. The biology of the prototypic alphavirus, 

IS Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than die full length genomic RNA, 
resulting in the overproduction of capsid proteins relative to die viral proteins with enzymatic activity 

20 (e.g., protease and polymerase). Similarly, inserting the coding sequence for SECP into die alphavirus 
genome in place of die capsid-coding region results in the production of a large number of SECP- 
coding RNAs and die synthesis of high levels of SECP in vector transduced cells. While alphavirus 
infection is typically associated with cell lysis within a few days, the ability to establish a persistent 
infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 

25 the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
(Dryga, S.A. et aL (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the 
introduction of SECP into a variety of cell types. The specific transduction of a subset of cells in a 
population may require the sorting of cells prior to transduction. The methods of manipulating 
infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and 

30 performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and + 10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
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be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of die ability of die double helix to open sufficiently for die binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature (Gee, J.R et aL (1994) in Huber, B.E. and B.I. Cair, Molecular and 

5 Immunologic Approaches. Futura Publishing, Mt Kisco NY, pp. 163-177). A complementary 

sequence or antisense molecule may also be designed to block translation of mRNA by preventing the 
transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 

10 molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of RNA molecules encoding SECP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 

15 GUU, and GUC. Once identified, short RNA sequences of between IS and 20 ribonucleotides, 
corresponding to die region of die target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonucfease protection assays. 

20 Complementary ribonucleic acid molecules and ribozymes may be prepared by any method 

known in the art for the synthesis of nucleic acid molecules. These include techniques for chemically 
synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. Alternatively, 
RNA molecules may be generated by in vitro and in vivo transcription of DNA molecules encoding 
SECP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable RNA 

25 polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitute vely or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, die addition of flanking sequences at the 5' and/or 3'ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 

30 within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, tiiio-, and similarly modified forms of adenine, cytidine, 
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guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

In other embodiments of the invention, the expression of one or more selected polynucleotides 
of the present invention can be altered, inhibited, decreased, or silenced using RNA interference 
(RNAi) or post-transcriptional gene silencing (PTGS) methods known in the art. RNAi is a post- 

5 transcriptional mode of gene silencing in which double-stranded RNA (dsRNA) introduced into a 
targeted cell specifically suppresses the expression of the homologous gene (i.e. , the gene bearing the 
sequence complementary to die dsRNA). This effectively knocks out or substantially reduces the 
expression of the targeted gene. PTGS can also be accomplished by use of DNA or DNA fragments 
as well. RNAi methods are described by Fire, A. et al. (1998; Nature 391:806-811) and Gura, T. 

10 (2000; Nature 404:804-808). PTGS can also be initiated by introduction of a complementary segment 
of DNA into the selected tissue using gene delivery and/or viral vector delivery methods described 
herein or known in the art 

RNAi can be induced in mammalian cells by the use of small interfering RNA also known as 
siRNA. SiRNA are shorter segments of dsRNA (typically about 21 to 23 nucleotides in length) that 

15 result in vivo from cleavage of introduced dsRNA by the action of an endogenous ribonuclease. 
SiRNA appear to be the mediators of the RNAi effect in mammals. The most effective siRNAs 
appear to be 21 nucleotide dsRNAs with 2 nucleotide 3' overhangs. The use of siRNA for inducing 
RNAi in mammalian cells is described by Elbashir, S.M. et aL (2001 ; Nature 41 1:494-498). 

SiRNA can either be generated indirectly by introduction of dsRNA into die targeted cell, or 

20 directly by mammalian transfection methods and agents described herein or known in the art (such as 
liposome-mediated transfection, viral vector methods, or other polynucleotide delivery/introductory 
methods). Suitable SiRNAs can be selected by examining a transcript of the target polynucleotide 
(e.g., mRNA) for nucleotide sequences downstream from the AUG start codon and recording die 
occurrence of each nucleotide and the 3' adjacent 19 to 23 nucleotides as potential siRNA target sites, 

25 with sequences having a 21 nucleotide length being preferred. Regions to be avoided for target 

siRNA sites include the 5' and 3* untranslated regions (UTRs) and regions near the start codon (within 
75 bases), as these may be richer in regulatory protein binding sites. UTR-binding proteins and/or 
translation initiation complexes may interfere with binding of the siRNP endonuclease complex. The 
selected target sites for siRNA can then be compared to the appropriate genome database (e.g., 

30 human, etc.) using BLAST or other sequence comparison algorithms known in the art Target 

sequences with significant homology to other coding sequences can be eliminated from consideration. 
The selected SiRNAs can be produced by chemical synthesis methods known in the art or by in vitro 
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transcription using commercially available methods and kits such as die SILENCER siRNA 
construction kit (Ambion, Austin XX). 

In alternative embodiments, long-term gene silencing and/or RNAi effects can be induced in 
selected tissue using expression vectors that continuously express siRNA. This can be accomplished 

5 using expression vectors that are engineered to express hairpin RNAs (shRNAs) using methods 

known in the art (see, e.g., Brummelkamp, T.R. et aL (2002) Science 296550-553; and Paddison, PJ. 
et aL (2002) Genes Dev. 16548-958). In these and related embodiments, shRNAs can be delivered to 
target cells using expression vectors known in the art An example of a suitable expression vector for 
delivery of siRNA is the PSELENCER1 .0-U6 (circular) plasmid (Ambion). Once delivered to die 

10 target tissue, shRNAs are processed in vivo into siRNA-like molecules capable of carrying out gene- 
specific silencing. 

In various embodiments, the expression levels of genes targeted by RNAi or PTGS methods 
can be determined by assays for mRNA and/or protein analysis. Expression levels of the mRNA of a 
targeted gene, can be determined by northern analysis methods using, for example, the 

15 NORTHERNMAX-GLY kit (Ambion); by microarray methods; by PCR methods; by real time PGR 
methods; and by other RNA/polynucleotide assays known in the art or described herein. Expression 
levels of the protein encoded by die targeted gene can be determined by Western analysis using 
standard techniques known in the art 

An additional embodiment of die invention encompasses a method for screening for a 

20 compound which is effective in altering expression of a polynucleotide encoding SECP. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 

25 compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased SECP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding SECP may be therapeutically useful, and in die treatment of disorders associated with 
decreased SECP expression or activity, a compound which specifically promotes expression of the 

30 polynucleotide encoding SECP may be therapeutically useful. 

In various embodiments, one or more test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
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commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 

5 library of chemical compounds created combinatorial^ or randomly. A sample comprising a 

polynucleotide encoding SECP is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabihzed cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in die expression of a polynucleotide encoding SECP are assayed by 
any method commonly known in the art. Typically, the expression of a specific nucleotide is detected 

10 by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding SECP. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of die 

15 polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schiwsaccharomyces pombe gene expression system 
(Atkins, D. et aL (1999) U.S. Patent No. 5,932,435; Arndt, CM. et aL (2000) Nucleic Acids Res. 
28J515) or a human cell line such as HeLa cell (Clarke, M.L. et aL (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 

20 combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic 
acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide sequence 
(Bruice, T.W. et aL (1997) U.S. Patent No. 5,686,242; Bniice, T.W. et aL (2000) US. Patent No. 
6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
25 for use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in die art (Goldman, C.K. et aL (1997) Nat Biotechnol. 15:462- 
466). 

30 Any of die therapeutic methods described above maybe applied to any subject in need of 

such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 
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An additional embodiment of the invention relates to the administration of a composition which 
generally comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 

5 Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of SECP, 
antibodies to SECP, and mimetics, agonists, antagonists, or inhibitors of SECP. 

In various embodiments, the compositions described herein, such as pharmaceutical 
compositions, may be administered by any number of routes including, but not limited to, oral, 
intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, pulmonary, 

10 transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient In the 
case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast- 
acting formulations is well-known in the art In the case of macromolecules (e.g. larger peptides and 

15 proteins), recent developments in the field of pulmonary delivery via die alveolar region of the lung 
have enabled the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. 
et aL, U.S. Patent No. 5,997,848)* Pulmonary delivery allows administration without needle injection, 
and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 

20 ingredients are contained in an effective amount to achieve the intended purpose. The determination 
of an effective dose is well within die capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising SECP or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 

25 macromolecule. Alternatively, SECP or a fragment thereof may be joined to a short cationic N- 
tenninal portion from die HTV Tat-1 protein. Fusion proteins thus generated have been found to 
transduce into the cells of aD tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, die therapeutically effective dose can be estimated initially either in cell 

30 culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 
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administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example SECP 
or fragments thereof, antibodies of SECP, and agonists, antagonists or inhibitors of SECP, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 

5 standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 

calculating die ED*, (the dose therapeutically effective in 50% of die population) or LD 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is die 
therapeutic index, which can be expressed as the LD 5( /ED 50 ratio. Compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 

10 used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED^ with little or no toxicity. 
The dosage varies within this range depending upon die dosage form employed, die sensitivity of die 
patient, and die route of administration. 

The exact dosage will be determined by die practitioner, in light of factors related to the 

15 subject requiring treatment Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect Factors which may be taken into account include the 
severity of the disease state, die general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combinations), reaction sensitivities, and response 
to therapy. Long-acting compositions may be administered every 3 to 4 days, every week, or 

20 biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 Mg to 100,000 Mg» up to a total dose of 
about 1 gram, depending upon die route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 

25 inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind SECP may be used for the 
diagnosis of disorders characterized by expression of SECP, or in assays to monitor patients being 
30 treated with SECP or agonists, antagonists, or inhibitors of SECP. Antibodies useful for diagnostic 
purposes may be prepared in die same manner as described above for therapeutics. Diagnostic 
assays for SECP include methods which utilize die antibody and a label to detect SECP in human body 
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fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covaknt or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring SECP, including EUSAs, RIAs, and FACS, are known 

5 in the art and provide a basis for diagnosing altered or abnormal levels of SECP expression. Normal 
or standard values for SECP expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibodies to SECP under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of SECP expressed in subject, 

10 control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of die invention, polynucleotides encoding SECP may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotides, complementary 
RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and quantify gene 

IS expression in biopsied tissues in which expression of SECP may be correlated with disease. The 

diagnostic assay may be used to determine absence, presence, and excess expression of SECP, and to 
monitor regulation of SECP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotides^ * 
including genomic sequences, encoding SECP or closely related molecules may be used to identify 

20 nucleic acid sequences which encode SECP. The specificity of the probe, whether it is made from a 
highly specific region, e.g., the 5* regulatory region, or from a less specific region, e.g., a conserved 
motif, and the stringency of the hybridization or amplification will determine whether die probe 
identifies only naturally occurring sequences encoding SECP, allelic variants, or related sequences. 
Probes may also be used for die detection of related sequences, and may have at least 50% 

25 sequence identity to any of die SECP encoding sequences. The hybridization probes of die subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO52-102 or 
from genomic sequences including promoters, enhancers, and introns of the SECP gene. 

Means for producing specific hybridization probes for polynucleotides encoding SECP include 
the cloning of polynucleotides encoding SECP or SECP derivatives into vectors for the production of 

30 mRNA probes. Such vectors are known in die art, are commercially available, and may be used to 
synthesize RNA probes in vitro by means of die addition of the appropriate RNA polymerases and 
the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter 
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groups, for example, by radionuclides such as **P or M S, or by enzymatic labels, such as alkaline 
phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotides encoding SECP may be used for the diagnosis of disorders associated with 
expression of SECP. Examples of such disorders include, but are not limited to, a cell proliferative 

5 disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia 
vera, psoriasis, primary thrombocythemia, and cancers including adenocarcinoma, leukemia, 
lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, a cancer of die adrenal 
gland, bladder, bone, bone marrow, brain, breast, cervix, colon, gall bladder, ganglia, gastrointestinal 

10 tract, heart, kidney, liver, hing, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, 
skin, spleen, testis, thymus, thyroid, and uterus; an autoimmune/inflammatory disorder such as acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, allergies, 
ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, 
autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy 

15 (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus; emphysema, episodic lymphopenia with lymphocyte toxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 

syndrome, gout, Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, *- 
multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, 

20 osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic 
infections, and trauma; a cardiovascular disorder such as congestive heart failure, ischemic heart 

25 disease, angina pectoris, myocardial infarction, hypertensive heart disease, degenerative valvular heart 
disease, calcific aortic valve stenosis, congenitaDy bicuspid aortic valve, mitral annular calcification, 
mitral valve prolapse, rheumatic fever and rheumatic heart disease, infective endocarditis, nonbacterial 
thrombotic endocarditis, endocarditis of systemic lupus erythematosus, carcinoid heart disease, 
cardiomyopathy, myocarditis, pericarditis, neoplastic heart disease, congenital heart disease, 

30 complications of cardiac transplantation, arteriovenous fistula, atherosclerosis, hypertension, vasculitis, 
Raynaud's disease, aneurysms, arterial dissections, varicose veins, thrombophlebitis and 
phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon angioplasty, vascular 
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replacement, and coronary artery bypass graft surgery; a neurological disorder such as epilepsy, 
ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 

5 pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral 
meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 
thrombophlebitis, myelitis and radiculitis, viral centra] nervous system disease, prion diseases including 
kuru, Creutzfeldt-Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal familial 
insomnia, nutritional and metabolic diseases of die nervous system, neurofibromatosis, tuberous 

10 sclerosis, cerebelloretinal hemangjoblastomatosis, encephalotrigeminal syndrome, mental retardation 
and other developmental disorders of the central nervous system including Down syndrome, cerebral 
palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 

15 gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 
seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, Tourette's disorder, progressive 
supranuclear palsy, corticobasal degeneration, and familial frontotemporal dementia; and a 
developmental disorder such as renal tubular acidosis, anemia, Cushing's syndrome, achondroplastjc 

20 dwarfism, Duchenne and Becker muscular dystrophy, epilepsy, gonadal dysgenesis, WAGR syndrome 
(Wilms' tumor, aniridia, genitourinary abnormalities, and mental retardation), Smith-Magenis syndrome, 
myelodysplastic syndrome, hereditary mucoepithehal dysplasia, hereditary keratodermas, hereditary 
neuropathies such as Charcot-Marie-Tooth disease and neurofibromatosis, hypothyroidism, 
hydrocephalus, seizure disorders such as Syndenham's chorea and cerebral palsy, spina bifida, 

25 anencephaly, craniorachischisis, congenital glaucoma, cataract, and sensorineural hearing loss. 
Polynucleotides encoding SECP may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PGR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered SECP expression. 
Such qualitative or quantitative methods are well known in the art 

30 In a particular embodiment, polynucleotides encoding SECP may be used in assays that detect 

the presence of associated disorders, particularly those mentioned above. Polynucleotides 
complementary to sequences encoding SECP may be labeled by standard methods and added to a 
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fluid or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and die signal is quantified and 
compared with a standard value. If the amount of signal in die patient sample is significantly altered in 
comparison to a control sample then die presence of altered levels of polynucleotides encoding SECP 

5 in the sample indicates the presence of die associated disorder. Such assays may also be used to 
evaluate die efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, 
or to monitor the treatment of an individual patient 

In order to provide a basis for the diagnosis of a disorder associated with expression of SECP, 
a normal or standard profile for expression is established. This may be accomplished by combining 

10 body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding SECP, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing die values obtained from normal subjects with 
values from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 

15 patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 

20 successive assays may be used to show die efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for die development 
of the disease, or may provide a means for detecting the disease prior to die appearance of actual 

25 clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier, thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding SECP 
may involve the use of PCR These oligomers may be chemically synthesized, generated 

30 enzymatically, or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide 
encoding SECP, or a fragment of a polynucleotide complementary to die polynucleotide encoding 
SECP, and will be employed under optimized conditions for identification of a specific gene or 
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condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or SNA sequences. 

In a particular aspect, oligonucleotide primers derived from polynucleotides encoding SECP 
may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitutions, insertions and 

5 deletions that are a frequent cause of inherited or acquired genetic disease in humans. Methods of 
SNP detection include, but are not limited to, single-stranded conformation polymorphism (SSCP) and 
fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from polynucleotides 
encoding SECP are used to amplify DNA using the polymerase chain reaction (PGR). The DNA 
may be derived, for example, from diseased or normal tissue, biopsy samples, bodily fluids, and the 

10 like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR products in 
single-stranded form, and these differences are detectable using gel electrophoresis in non-denaturing 
gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows detection of the 
amplimers in high-throughput equipment such as DNA sequencing machines. Additionally, sequence 
database analysis methods, termed in silico SNP (isSNP), are capable of identifying polymorphisms by 

15 comparing the sequence of individual overlapping DNA fragments which assemble into a common 
consensus sequence. These computer-based methods filter out sequence variations due to laboratory 
preparation of DNA and sequencing errors using statistical models and automated analyses of DNA 
sequence chromatograms. In the alternative, SNPs may be detected and characterized by mass 
spectrometry using, for example, the high throughput MASSARRAY system (Sequenom, Inc., San 

20 Diego CA). 

SNPs may be used to study the genetic basis of human disease. For example, at least 16 
common SNPs have been associated with non-insulin-dependent diabetes mellitus. SNPs are also 
useful for examining differences in disease outcomes in monogenic disorders, such as cystic fibrosis, 
sickle cell anemia, or chronic granulomatous disease. For example, variants in the mannose-binding 

25 lectin, MBL2, have been shown to be correlated with deleterious pulmonary outcomes in cystic 
fibrosis. SNPs also have utility in pharmacogenomics, the identification of genetic variants that 
influence a patient's response to a drug, such as life-threatening toxicity. For example, a variation in 
N-acetyl transferase is associated with a high incidence of peripheral neuropathy in response to the 
anti-tuberculosis drug isoniazid, while a variation in the core promoter of the ALOX5 gene results in 

30 diminished clinical response to treatment with an anti-asthma drug that targets the 5-Hpoxygenase 
pathway. Analysis of the distribution of SNPs in different populations is useful for investigating 
genetic drift, mutation, recombination, and selection, as well as for tracing the origins of populations 
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and their migrations (Taylor, J.G et al. (2001) Trends MoL Med. 7:507-512; Kwok, P.-Y. and Z. Gu 
(1999) MoL Med. Today 5:538-543; Nowotny, P. etal. (2001) Cuit. Opin. NeurobioL 11:637-641). 

Methods which may also be used to quantify the expression of SECP include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
5 standard curves (Melby, P.C et aL (1993) J. Immunol Methods 159:235-244; Duplaa, C. et al. (1993) 
Anal. Biochem. 212:229-236). The speed of quantitation of multiple samples maybe accelerated by 
running die assay in a high-throughput format where the oligomer or polynucleotide of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

10 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotides described herein may be used as elements on a microarray. The micro array can be 
used in transcript imaging techniques which monitor the relative expression levels of large numbers of 
genes simultaneously as described below. The microarray may also be used to identify genetic 
variants, mutations, and polymorphisms . This information may be used to determine gene function, to 

15 understand die genetic basis of a disorder, to diagnose a disorder, to monitor pmgression/regression of 
disease as a function of gene expression, and to develop and monitor the activities of therapeutic 
agents in the treatment of disease. In particular, this information may be used to develop a 
pharmacogenomic profile of a patient in order to select the most appropriate and effective treatment 
regimen for that patient For example, therapeutic agents which are highly effective and display the 

20 fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, SECP, fragments of SECP, or antibodies specific for SECP may be 
used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

25 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at 
a given time (Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent No. 5,840,484; 
hereby expressly incorporated by reference herein). Thus a transcript image may be generated by 

30 hybridizing the polynucleotides of die present invention or their complements to die totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
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invention or their complements comprise a subset of a plurality of elements on a micro array. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the 

5 case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 

10 molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et aL (1999) MoL Caicinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 
Toxicol. Lett 112-113:467-471). If a test compound has a signature similar to that of a compound 
with known toxicity, it is likely to share those toxic properties. These fingerprints or signatures are 
most useful and refined when they contain expression information from a large number of genes and 

15 gene families. Ideally, a genome-wide measurement of expression provides the highest quality 
signature. Even genes whose expression is not altered by any tested compounds are important as 
well, as die levels of expression of these genes are used to normalize the rest of the expression data. 
Hie normalization procedure is useful for comparison of expression data after treatment with different 
compounds. While the assignment of gene function to elements of a toxicant signature aids in 

20 interpretation of toxicity mechanisms, knowledge of gene function is not necessary for the statistical 
matching of signatures which leads to prediction of toxicity (see, for example, Press Release 00-02 
from the National Institute of Environmental Health Sciences, released February 29, 2000, available at 
http^/www.niehs JMh.gov/oc/newsAoxchipJitm). Therefore, it is important and desirable in 
toxicological screening using toxicant signatures to include all expressed gene sequences. 

25 In an embodiment, the toxicity of a test compound can be assessed by treating a biological 

sample containing nucleic acids with die test compound. Nucleic acids that are expressed in the 
treated biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present invention 
may be quantified. The transcript levels in the treated biological sample are compared with levels in 

30 an untreated biological sample. Differences in fee transcript levels between fee two samples are 
indicative of a toxic response caused by the test compound in the treated sample. 

Another embodiment relates to die use of fee polypeptides disclosed herein to analyze the 
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proteome of a tissue or cell type. Hie term proteome refers to the global pattern of protein expression 
in a particular tissue or cell type. Each protein component of a proteome can be subjected individually 
to further analysis. Proteome expression patterns, or profiles, are analyzed by quantifying the number 
of expressed proteins and their relative abundance under given conditions and at a given time. A 

5 profile of a cell's proteome may thus be generated by separating and analyzing die polypeptides of a 
particular tissue or cell type. In one embodiment, the separation is achieved using two-dimensional gel 
electrophoresis, in which proteins from a sample are separated by isoelectric focusing in the first 
dimension, and then according to molecular weight by sodium dodecyl sulfate slab gel electrophoresis 
in the second dimension (Steiner and Anderson, supra). The proteins are visualized in the gel as 

10 discrete and uniquely positioned spots, typically by staining the gel with an agent such as Coomassie 
Blue or silver or fluorescent stains. The optical density of each protein spot is generally proportional to 
the level of the protein in the sample. The optical densities of equi vafently positioned protein spots 
from different samples, for example, from biological samples either treated or untreated with a test 
compound or therapeutic agent, are compared to identify any changes in protein spot density related to 

15 the treatment The proteins in die spots are partially sequenced using, for example, standard methods 
employing chemical or enzymatic cleavage followed by mass spectrometry. The identity of the protein 
in a spot may be determined by comparing its partial sequence, preferably of at least 5 contiguous 
amino acid residues, to the polypeptide sequences of interest In some cases, further sequence data 
may be obtained for definitive protein identification. 

20 A proteomic profile may also be generated using antibodies specific for SECP to quantify the 

levels of SECP expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing die microarray to the sample and detecting 
the levels of protein bound to each array element (Lucking, A. et al. (1999) Anal Biochem. 270:103- 
111; Mendoze, L.G. et aL (1999) Biotechniques 27:778-788). Detection may be performed by a 

25 variety of methods known in the art, for example, by reacting the proteins in die sample with a thiol- or 
amino-reacdve fluorescent compound and detecting the amount of fluorescence bound at each array 
element 

Toxicant signatures at die proteome level are also useful for toxicological screening, and 
should be analyzed in parallel with toxicant signatures at the transcript leveL There is a poor 
30 correlation between transcript and protein abundances for some proteins in some tissues (Anderson, 
N.L. and J. Seilhamer (1997) Electrophoresis 18533-537), so proteome toxicant signatures may be 
useful in the analysis of compounds which do not significantly affect the transcript image, but which 
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alter die proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated 

5 biological sample are separated so that the amount of each protein can be quantified. The amount of 
each protein is compared to die amount of die corresponding protein in an untreated biological sample. 
A difference in the amount of protein between the two samples is indicative of a toxic response to die 
test compound in the treated sample. Individual proteins are identified by sequencing the amino acid 
residues of the individual proteins and comparing these partial sequences to die polypeptides of the 

10 present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with die test compound. Proteins from the biological sample are incubated 
with antibodies specific to die polypeptides of die present invention. The amount of protein recognized 
by the antibodies is quantified. Hie amount of protein in the treated biological sample is compared 

15 with die amount in an untreated biological sample. A difference in the amount of protein between the 
two samples is indicative of a toxic response to die test compound in the treated sample. 

Micro arrays may be prepared, used, and analyzed using methods known in the art (Brennan, 
T.M. et aL (1995) U.S. Patent No. 5,474,796; Schena, M. et aL (1996) Proc. Nad. Acad. Sci. USA 
93:10614-10619; Baldeschweifer et aL (1995) PCT application W095/25116; Shaten, D. et aL (1995) 

20 PCT application WO95/35505; Heifer, R.A. et aL (1997) Proc. NatL Acad. Sci. USA 94:2150-2155; 
Heller, M J. et aL (1997) U.S. Patent No. 5,605,662). Various types of microarrays are well known 
and thoroughly described in Schena, M., ed. (1999; DNA Microarravs: A Practical Approach . Oxford 
University Press, London). 

In another embodiment of the invention, nucleic acid sequences encoding SECP may be used 

25 to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences maybe used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 

30 chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries (Harrington, J J. et aL (1997) Nat Genet 15:345- 
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355; Price, CM. (1993) Blood Rev. 7:127-134; Trask, B J. (1991) Trends Genet 7:149-154). Once 
mapped, the nucleic acid sequences may be used to develop genetic linkage maps, for example, which 
correlate die inheritance of a disease state with the inheritance of a particular chromosome region or 
restriction fragment length polymorphism (RFLP) (Lander, E.S. and D. Botstein (1986) Proc. Natl. 

5 Acad. Sci. USA 83:7353-7357). 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data (Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968). Examples of genetic map data 
can be found in various scientific journals or at die Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding SECP on a physical 

10 map and a specific disorder, or a predisposition to a specific disorder, may help define die region of 
DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gem on the chromosome of another mammalian species, such as mouse, 

15 may reveal associated markers even if die exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any 
sequences mapping to that area may represent associated or regulatory genes for further investigation 

20 (Gatti, R.A. et aL (1988) Nature 336:577-580). The nucleotide sequence of the instant invention may 
also be used to detect differences in die chromosomal location due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

In another embodiment of the invention, SECP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drag 

25 screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracelhilarly. The formation of binding complexes 
between SECP and the agent being tested may be measured. 

Another technique for drag screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (Geysen, et al. (1984) PCT application 

30 WO84/03564). In this method, large numbers of different small test compounds are synthesized on a 
solid substrate. The test compounds are reacted with SECP, or fragments thereof, and washed. 
Bound SECP is then detected by methods well known in the art Purified SECP can also be coated 
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directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding SECP specifically compete with a test compound for binding SECP. In 
5 this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with SECP. 

In additional embodiments, the nucleotide sequences which encode SECP may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
10 properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent The following embodiments are, therefore, 
to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way 
whatsoever. 

15 The disclosures of all patents, applications, and publications mentioned above and below, 

including US. Ser. No. 60/357,002, U.S. Ser. No. 60/362,439, and U.S. Ser. No. 60/366,041, are 
hereby expressly incorporated by reference. 

EXAMPLES 

20 I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in die UFESEQ GOLD 
database (Incyte Genomics, Palo Alto CA). Some tissues were homogenized and lysed in guanidinhim 
isothiocyanate, while others were homogenized and lysed in phenol or in a suitable mixture of 
denaturants, such as TRIZOL (Invitrogen), a monophasic solution of phenol and guanidine 

25 isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted with 

chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was 

30 isolated using oHgo d(T)-coupled paramagnetic particles (Promega), OLK5OTEX latex particles 
(QIAGEN, Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, 
RNA was isolated directly from tissue lysates using other RNA isolation kits, e.g., the 
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POLY(A)PURE mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed tbe corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasmid system (Invitrogen), using the 

5 recommended procedures or similar methods known in the ait (Ausubel et a]., supra, ch. 5). Reverse 
transcription was initiated using oHgo d(T) or random primers. Synthetic oligonucleotide adapters were 
ligated to double stranded cDNA, ami the cDNA was digested with the appropriate restriction enzyme 
or enzymes. For most libraries, die cDNA was size-selected (300-1000 bp) using SEPHACRYL 
S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column chromatography (Amersham 

10 Biosciences) or preparative agarose gel electrophoresis. cDNAs were ligated into compatible 
restriction enzyme sites of the polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid 
(Stratagene), PSPORT1 plasmid (Invitrogen, Carlsbad CA), PCDNA2.1 plasmid (Invitrogen), PBK- 
CMV plasmid (Stratagene), PCR2-TOPOTA plasmid (Invitrogen), PCMV-IOS plasmid (Stratagene), 
pIGEN (Incyte Genomics, Palo Alto CA), pRARE (Incyte Genomics), or pINCY (Incyte Genomics), 

15 or derivatives thereof. Recombinant plasmids were transformed into competent £ coli cells including 
XLl-Blue, XLl-BhieMRF, or SOLR from Stratagene or DHSa, DH10B, or ElectroMAX DH10B 
from Invitrogen. 

II. Isolation of eDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo 
20 excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Phis Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 
96 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 
25 ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PGR in a 
high-throughput format (Rao, V.B. (1994) Anal Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
3 84- well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
30 PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 
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Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation such 
as die ABI CATALYST 800 (Applied Biosystems) thermal cycler or the FTC-200 thermal cycler 
(MJ Research) in conjunction with die HYDRA microdispenser (Robbins Scientific) or the 

5 MICROLAB 2200 (Hamilton) liquid transfer system cDNA sequencing reactions were prepared 
using reagents provided by Amersham Biosciences or supplied in ABI sequencing kits such as the 
ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
were carried out using the MEG AB ACE 1000 DNA sequencing system (Amersham Biosciences); 

10 the ABI PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard 
ABI protocols and base calling software; or otter sequence analysis systems known in the ait. 
Reading frames within the cDNA sequences wore identified using standard methods (Ausubel et aL, 
supra, ch 7). Some of the cDNA sequences were selected for extension using die techniques 
disclosed in Example VUL 

15 The polynucleotide sequences derived from Incyte cDNAs were validated by removing 

vector, linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and 
programs based on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The 
Incyte cDNA sequences or translations thereof were then queried against a selection of public 
databases such as die GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and 

20 BLOCKS, HUNTS, DOMO, PRODOM; PROTEOME databases with sequences from Homo 
sapiens, Rattus norvegicus, Mus museidus, Caenorhabditis elegans, Saccharomyces cerevisiae, 
Schizosaccharomyces pombe y and Candida albicans (Incyte Genomics, Palo Alto CA); hidden 
Markov model (HMM>based protein family databases such as PFAM, INCY, and T1GRFAM (Haft, 
D.H. et aL (2001) Nucleic Acids Res. 29:41-43); and HMM-based protein domain databases such as 

25 SMART (Schuhz, J. et aL (1998) Proc. Natl Acad. Sci. USA 95:5857-5864; Letunic, I. et aL (2002) 
Nucleic Acids Res. 30:242-244). (HMM is a probabilistic approach which analyzes consensus 
primary structures of gene families; see, for example, Eddy, S R. (1996) Curr. Opin. Struct Biol. 
6:361-365.) The queries were performed using programs based on BLAST, FASTA, BLIMPS, and 
HMMER The Incyte cDNA sequences were assembled to produce frill length polynucleotide 

30 sequences. Alternatively, GenBank cDNAs, GenBank ESTs, stitched sequences, stretched 

sequences, or Genscan-predicted coding sequences (see Examples IV and V) were used to extend 
Incyte cDNA assemblages to full length. Assembly was performed using programs based on Phred, 
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Fhrap, and Consed, and cDNA assemblages were screened for open reading frames using programs 
based on GeneMark, BLAST, and FASTA. The full length polynucleotide sequences were translated 
to derive the corresponding full length polypeptide sequences. Alternatively, a polypeptide may begin 
at any of die methionine residues of the full length translated polypeptide. Full length polypeptide 

5 sequences were subsequently analyzed by querying against databases such as the GenBank protein 
databases (genpept), SwissProt, the PROTEOME databases, BLOCKS, PRINTS, DOMO, 
PRODOM, Prosite, hidden Markov model (HMM>based protein family databases such as PFAM, 
INCY, and T1GRFAM; and HMM-based protein domain databases such as SMART. Full length 
polynucleotide sequences are also analyzed using MACDNASIS PRO software (MiraiBio, Alameda 

10 CA) and LASERGENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments 
are generated using default parameters specified by die CLUSTAL algorithm as incorporated into the 
MEGALIGN multi sequence alignment program (DNASTAR), which also calculates die percent 
identity between aligned sequences. 

Table 7 summarizes die tools, programs, and algorithms used for the analysis and assembly of 

IS Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 
which are incorporated by reference herein in their entirety, and the fourth column presents, where 
applicable, the scores, probability values, and other parameters used to evaluate the strength of a 

20 match between two sequences (the higher the score or die lower die probability value, die greater the 
identity between two sequences). 

The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:52-102. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 

25 amplification technologies are described in Table 4, column 2. 

TV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative secreted proteins were initially identified by running the Genscan gene identification 
program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is a general- 
purpose gene identification program which analyzes genomic DNA sequences from a variety of 

30 organisms (Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94; Burge, C. and S. Karlin (1998) 
Curr. Opin. Struct Biol. 8346-354). The program concatenates predicted exons to form an 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan is a 
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FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 
sequences encode secreted proteins, the encoded polypeptides were analyzed by querying against 
PFAM models for secreted proteins. Potential secreted proteins were also identified by homology to 

5 Incyte cDNA sequences that had been annotated as secreted proteins. These selected Genscan- 
predicted sequences were then compared by BLAST analysis to the genpept and gbpri public 
databases. Where necessary, the Genscan-predicted sequences were then edited by comparison to 
the top BLAST hit from genpept to correct errors in the sequence predicted by Genscan, such as 
extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or public cDN A 

10 coverage of die Genscan-predicted sequences, thus providing evidence for transcription. When Incyte 
cDNA coverage was available, tins information was used to correct or confirm the Genscan predicted 
sequence. Full length polynucleotide sequences were obtained by assembling Genscan-predicted 
coding sequences with Incyte cDNA sequences and/or public cDNA sequences using the assembly 
process described in Example m. Alternatively, full length polynucleotide sequences were derived 

15 entirely from edited or unedited Genscan-predicted coding sequences. 

V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene 
identification program described in Example IV. Partial cDNAs assembled as described in Example 

20 HI were mapped to genomic DNA and parsed into clusters containing related cDNAs and Genscan 
exon predictions from one or more genomic sequences. Each cluster was analyzed using an algorithm 
based on graph theory and dynamic programming to integrate cDN A and genomic information, 
generating possible splice variants that were subsequently confirmed, edited, or extended to create a 
full length sequence. Sequence intervals in which die entire length of the interval was present on 

25 more than one sequence in the cluster were identified, and intervals thus identified were considered to 
be equivalent by transitivity. For example, if an interval was present on a cDNA and two genomic 
sequences, then all three intervals were considered to be equivalent This process allows unrelated 
but consecutive genomic sequences to be brought together, bridged by cDNA sequence. Intervals 
thus identified were then "stitched" together by the stitching algorithm in die order that they appear 

30 along their parent sequences to generate the longest possible sequence, as well as sequence variants, 
linkages between intervals which proceed along one type of parent sequence (cDNA to cDNA or 
genomic sequence to genomic sequence) were given preference over linkages which change parent 
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type (cDNA to genomic sequence). The resultant stitched sequences were translated and compared 
by BLAST analysis to the genpept and gbpri public databases. Incorrect exons predicted by Genscan 
were corrected by comparison to die top BLAST bit from genpept Sequences were further extended 
with additional cDNA sequences, or by inspection of genomic DNA, when necessary. 

5 "Stretched" Sequences 

Partial DNA sequences were extended to full length with an algorithm based on BLAST 
analysis. First, partial cDNAs assembled as described in Example III were queried against public 
databases such as die GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases 
using the BLAST program. The nearest GenBank protein homolog was then compared by BLAST 

10 analysis to either Incyte cDNA sequences or GenScan exon predicted sequences described in 
Example IV. A chimeric protein was generated by using the resultant high-scoring segment pairs 
(HSPs) to map the translated sequences onto die GenBank protein homolog. Insertions or deletions 
may occur in the chimeric protein with respect to the original GenBank protein homolog. The 
GenBank protein homolog, the chimeric protein, or both were used as probes to search for homologous 

15 genomic sequences from the public human genome databases. Partial DNA sequences were 

therefore "stretched" or extended by the addition of homologous genomic sequences. The resultant 
stretched sequences were examined to determine whether it contained a complete gene. 
VI. Chromosomal Mapping of SECP Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:52-102 were compared with 

20 sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:52-102 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 

25 Genome Research (WIGR), and Genethon were used to determine if any of die clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 

30 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM distances 
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are based on genetic markers mapped by Genethon which provide boundaries for radiation hybrid 
markers whose sequences were included in each of the clusters. Human genome maps and other 
resources available to the public, such as the NCBI "GeneMap*99" World Wide Web site 
(http^/www.ncbi.nlnxnih.gov/genemap/) 9 can be employed to determine if previously identified disease 

5 genes map within or in proximity to die intervals indicated above. 
Association of SECP polynucleotides with Lung Cancer 

Heritable forms of lung carcinoma have not been reported and thus, identification of relevant 
disease-associated genes through conventional linkage analysis is not possible. However, several 
studies of sporadic nonsmall cell lung carcinoma (NSCLC) tumors have reported loss of 

10 heterozygosity (LOH) in regions of chromosome 1 1 suggesting the presence of one or more tumor 
suppressor genes (Bepler, G. and Garcia-Blanco, M.A. (1994) Proa Natl Acad. Sci. 915513-5517; 
Iizuka, M. (1995) Genes, Chromosomes & Cancer 13:40-46; Rasio, D. (1995) Cancer Research 
55:3988-91). In a study of 79 patients with lung cancer, Iizuka and coworkers found feat llql4- 
llq24.2 was. deleted in many of die lung tumors studied. Mapping of this region with additional 

15 markers showed that the region of chromosome llq bounded by markers D11S939 and D11S938 was 
commonly deleted (Iizuka, et al., supra). In another study it was shown that human A549 NSCLC 
cells transformed with a human-derived YAC clone containing a region of chromosome llq within die 
region bounded by D11S939 and D11S938, exhibited little or no increase in cell number (versus control 
cells whose number increased 5-10-fold in die same 5 day period). Further studies of these hybrid 

20 cells showed a decrease in tumorigenicity and an increase in latency following injection into athymic, 
nude mice, as compared with mice injected with control A549 cells. These studies suggest the 
presence of a tumor suppressor gene within this region of chromosome llq and support the 
association of LOH in this region with NSCLC. 

Restriction fragment length polymorphism (RFLP) markers shown to be near regions of DNA 

25 known as sequence-tagged sites (STS), have been mapped to NT_Contigs generated by die Human 
Genome Project using ePCR (Schuler, GJX (1997) Genome Research 7: 541-550, and (1998) Trends 
BiotechnoL 16(ll):456-459). Contigs containing regions of DNA with known disease-associated 
markers are therefore used to identify SECP sequences that map to disease-associated regions of the 
genome. 

30 Polynucleotides encoding SECP were mapped to NTjContigs. Contigs longer than 1Mb were 

broken into subcontigs of 1Mb length with overlapping sections of lOOkb. A preliminary step used an 
algorithm, similar to MEG ABLAST, to define the mRNA sequence /masked genomic DNA contig 
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pairings. The cDNA/gcnomic pairings identified by the first algorithm were confirmed, and the SECP 
polynucleotides mapped to DNA contigs, using SIM4 (Florea, L. et al. (1998) Genome Res. 8:967-74, 
version May 2000) which had been optimized for high throughput processing and strand assignment 
confidence). The SIM4 output of the mRNA sequence/genomic contig pairs was further processed to 
5 determine the correct location of the SECP polynucleotides on the genomic contig, as well as their 
strand identity. 

SEQ ID NO:63 was mapped to NT_Contig NT_009151_021.8 from Genbank, version 128, 
covering a 5.5 Mb region of the genome that also contains lung cancer-associated genetic markers 
D11S939 and D11S938. The maximum distance between SEQ ID NO:63 and markers D11S939 and 

10 D11S938, therefore, is 5.5 Mb. Thus, SEQ ID NO:63 is in proximity with genetic markers shown to 
consistently associate with lung cancer. Therefore, in various embodiments, SEQ ID NO:63 can be 
used for one or more of die following: i) determination of LOH in persons with lung cancer in the lung 
cancer disease region at 1 lql2-24.2, ii) diagnostic assays for hing cancer, and iii) developing 
therapeutics and/or other treatments for lung cancer. 

15 SEQ ID NO:64 was mapped to NT_Contig NT_009151_021.8 from Genbank, version 128, 

covering a 5.5 Mb region of the genome that also contains lung cancer-associated genetic markers 
D11S939 andDHS938. The maximum distance between SEQ ID NO:64 and markers D11S939 and 
Dl 1S938, therefore, is 5.5 Mb. Thus, SEQ ID NO:64 is in proximity with genetic markers shown to 
consistently associate with lung cancer. Therefore, in various embodiments, SEQ ID NO:64 can be 

20 used for one or more of the following: i) determination of LOH in persons with lung cancer in the lung 
cancer disease region at llql2-24.2, ii) diagnostic assays for hing cancer, and iii) developing 
therapeutics and/or other treatments for lung cancer. 
Association of SECP polynucleotides wife Alzheimer's Disease 

SECP polynucleotides were mapped to NTjContigs, available from NCBI, using the following 

25 procedures. Contigs longer than 1Mb were broken into subcontigs of 1Mb in length wife overlapping 
sections of lOOkb. A preliminary step used an algorithm, similar to MEGABLAST, to define fee 
mRNA sequence/masked genomic DNA contig pairings. The cDN A/genomic pairings identified by 
the first algorithm were run through Sim4 (Florea, L. et al. (1998) Genome Res. 8:967-74, version May 
2000) feat had been optimized in house for high throughpu t and strand assignment confidence). The 

30 Sim4 output of fee mRNA sequence/genomic contig pairs was further processed to determine fee 
correct location of fee SECP polynucleotides on fee genomic contig, and also their strand identity. 

Loci on chromosomes feat map to regions associated wife particular diseases can be used as 
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markers for these particular diseases. These markers then can be used to develop diagnostic and 
therapeutic tools for these diseases. For example, loci on chromosome 10 are associated with or 
linked to Alzheimer's disease (AD), a progressive neurodegenerative disease that represents the most 
common form of dementia (Ait-Ghezala, G. et al. (2002) Neurosci Lett 325:87-90). AD can be 

5 inherited as an autosomal dominant trait Further, genetic studies have focused on identification of 
genes that are potential targets for new treatments or improved diagnostics. The deposition and 
aggregation of p-amyloid in specific regions of the brain are key neuropathological hallmarks of AD. 
Insulin-degrading enzyme (DDE) can degrade ^-amyloid Abraham, R. et al. (2001) Hum Genet 
109:646-652). The IDE gene has been mapped near an AD-associated locus, 10q23-q25 (Espinosa R. 

10 3 1 * et al. (1991) Cytogenet Cell Genet 57:184-186). Linkage analysis using IDE gene markers was 
performed on 1426 subjects from 435 families in which at least two family members were affected 
with AD. 

A logarithm of the odds ratio for linkage (tod) score of over 3 indicates a probability of 1 in 
1000 that a particular marker was found solely by chance in affected individuals. Significant linkage 

15 (lod score of 3.3) was reported between the polymorphic marker D10S583, located at 1 15.3 cM on 
chromosome 10, and AD with age of onset 2>50 years (Betram, L. et al. (2000) Science 290:2302- 
2303). D10S583 maps 36 kb upstream of the IDE gene. Further analysis of this region, however, 
foiled to show association of SNPs (single nucleotide polymorphisms) within die IDE gene and 
flanking regions with late-onset AD (LOAD), in a study of 134 Caucasian LOAD cases and 111 

20 matched controls from the United Kingdom (Abraham, R. et al, supra). Thus, although the activity of 
IDE may not influence die susceptibility to LOAD, there is substantial linkage in the chromosomal 
region containing the IDE gene, marker D10S583, and AD. The IDE gene and D10S583 both map to 
contig NT_008769, which contains a region of chromosome 10 that is 9. 16 Mb in size. 

SEQ ID NO:61 mapped to a region of contig NT_008804__002.8 from GenBank (version 128), 

25 localizing SEQ ID NO:61 to within 9.16 Mb of die Alzheimer's disease locus on chromosome lOq. 
Thus, SEQ ID NO:61 is in proximity with loci shown to consistently associate with Alzheimer's 
disease. Therefore, in various embodiments, SEQ ID NO:61 can be used for one or more of die 
following: i) linkage analysis of persons and/or families to the AD disease region at lOq, ii) diagnostic 
assays for AD, and iii) developing therapeutics and/or other treatments for AD. 

30 VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RN As 
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from a particular cell type or tissue have been bound (Sambrook and Russell, supra, ch. 7; Ausubel et 
al., supra y ch. 4). 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in databases such as GenBank or OFESEQ (Incyte Genomics). This analysis is much 
5 faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer search 
can be modified to determine whether any particular match is categorized as exact or similar. Hie 
basis of the search is die product score, which is defined as: 

BLAST Score x Percent Identity 
10 5 x minimum {length(Seq. 1), lengtb(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 

15 product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 
(HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by 
gaps). If there is more than one HSP, then the pair with the highest BLAST score is used to calculate 
the product score. The product score represents a balance between fractional overlap and quality in a 

20 BLAST alignment For example, a product score of 100 is produced only for 100% identity over die 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 
either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at die 
other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

25 Alternatively, polynucleotides encoding SECP are analyzed with respect to the tissue sources 

from which they were derived. For example, some full length sequences are assembled, at least in 
part, with overlapping Incyte cDNA sequences (see Example HI). Each cDNA sequence is derived 
from a cDNA library constructed from a human tissue. Each human tissue is classified into one of the 
following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 

30 embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; 
hemic and immune system; liver, musculoskeletal system; nervous system; pancreas; respiratory 
system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract The number 
of libraries in each category is counted and divided by the total number of libraries across all 
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categories. Similarly, each human tissue is classified into one of the following disease/condition 
categories: cancer, cell line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, 
and other, and the number of libraries in each category is counted and divided by die total number of 
libraries across all categories. The resulting percentages reflect the tissue- and disease-specific 

5 expression of cDNA encoding SECT. cDNA sequences and cDNA library/tissue information are 
found in the LEFESEQ GOLD database (Incyte Genomics, Palo Alto CA). 
VIII. Extension of SECP Encoding Polynucleotides 

Full length polynucleotides are produced by extension of an appropriate fragment of the full 
length molecule using oligonucleotide primers designed from this fragment One primer was 

10 synthesized to initiate 5* extension of the known fragment, and the other primer was synthesized to 
initiate 3* extension of the known fragment The initial primers were designed using OLIGO 4.06 
software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 

IS structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PGR using methods well known in the art PGR. 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 

20 mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH^CX,, 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Biosciences), ELONGASE enzyme 
(Invitrogen), and Pfu DNA polymerase (Stratagene), with the following parameters for primer pair 
PQ A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 2 
min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 

25 alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57 °C, 1 min; Step 4: 68°C, 2 min, Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C 

The concentration of DNA in each well was determined by dispensing 100 fil PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 

30 and 05 jil of undiluted PGR product into each well of an opaque fhiorimeter plate (Corning Costar, 
Acton MA), allowing die DNA to bind to the reagent The plate was scanned in a Fhioroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
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concentration of DNA. A 5 jd to 10 §A aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose gel to determine which reactions were successful in extending the 
sequence. 

Hie extended nucleotides were desalted and concentrated, transferred to 3 84- well plates, 
5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Biosciences). For shotgun 
sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose gels, 
fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
10 Biosciences), treated with Pfti DNA polymerase (Stratagene) to fill-in restriction site overhangs, and 
transfected into competent K coli cells* Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 3 84- well plates in LB/2x 
carb liquid media. ' 

The cells were lysed, and DNA was amplified by PGR using Taq DNA polymerase 
15 (Amersham Biosciences) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 
1: 94°C, 3 min; Step 2: 94 °C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72 °C, 2 min; Step 5: steps 2, 3, and 
4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by 
PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA recoveries 
were reamplified using the same conditions as described above. Samples were diluted with 20% 
20 dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing primers 
and die DYENAMIC DIRECT kit (Amersham Biosciences) or the ABI PRISM B1GDYE 
Terminator cycle sequencing ready reaction kit (Applied Biosystems). 

In like manner, full length polynucleotides are verified using die above procedure or are used 
to obtain 5* regulatory sequences using the above procedure along with oligonucleotides designed for 
25 such extension, and an appropriate genomic library. 

IX. Identification of Single Nucleotide Polymorphisms in SECP Encoding 
Polynucleotides 

Common DNA sequence variants known as single nucleotide polymorphisms (SNPs) were 
identified in SEQ ID NO:52-102 using die LBFESEQ database (Incyte Genomics). Sequences from 
30 the same gene were clustered together and assembled as described in Example m, allowing the 
identification of all sequence variants in die gene. An algorithm consisting of a series of filters was 
used to distinguish SNPs from other sequence variants. Preliminary filters removed the majority of 
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basecall errors by requiring a minimum Fhred quality score of 15, and removed sequence alignment 
errors and errors resulting from improper trimming of vector sequences, chimeras, and splice variants. 
An automated procedure of advanced chromosome analysis analysed the original chromatogram files 
in the vicinity of the putative SNP. Clone error filters used statistically generated algorithms to identify 
5 errors introduced during laboratory processing, such as those caused by reverse transcriptase, 
polymerase, or somatic mutation. Clustering error filters used statistically generated algorithms to 
identify errors resulting from clustering of close homobgs or pseudogenes, or due to contamination by 
non-human sequences. A final set of filters removed duplicates and SNPs found in immunoglobulins 
or T-cell receptors. 

10 Certain SNPs were selected for further characterization by mass spectrometry using the high 

throughput MASSARRAY system (Sequenom, Inc.) to analyze allele frequencies at the SNP sites in 
four different human populations. The Caucasian population comprised 92 individuals (46 male, 46 
female), including 83 from Utah, four French, three Venezualan, and two Amish individuals. The 
African population comprised 194 individuals (97 male, 97 female), all African Americans. The 

15 Hispanic population comprised 324 individuals (162 male, 162 female), all Mexican Hispanic. The 
Asian population comprised 126 individuals (64 male, 62 female) with a reported parental breakdown 
of 43% Chinese, 31% Japanese, 13% Korean, 5% Vietnamese, and 8% other Asian. Allele 
frequencies were first analyzed in the Caucasian population; in some cases those SNPs which showed 
no allelic variance in this population were not further tested in the other three populations. 

20 X. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO.52-102 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially die same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 

25 software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of 

[y- 37 ?] adenosine triphosphate (Amersham Biosciences), and T4 polynucleotide kinase (DuPont NEN, 
Boston MA). Hie labeled oligonucleotides are substantially purified using a SEPHADEX G-25 
superfine size exclusion dextran bead column (Amersham Biosciences). An aliquot containing 10 7 
counts per minute of the labeled probe is used in a typical membrane-based hybridization analysis of 

30 human genomic DNA digested with one of the following endonucleases: Ase I, Bgl n, Eco RI, Pst I, 
Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
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membranes (Nytran Phis, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
5 compared. 

XI. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing; see, e.g., Baldeschweiler et aL, supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 

10 aforementioned technologies should be uniform and solid with a non-porous surface (Schena, M., ed. 
(1999) DNA Microarrays: A Practical Approach. Oxford University Press, London). Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 

15 produced using available methods and machines well known to those of ordinary skill in the art and 
may contain any appropriate number of elements (Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et aL (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat Biotechnol. 
16:27-31). 

Pull length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
20 comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The 
array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
25 fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
die microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 
30 Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using die oligo-(dT) cellulose method. Each poly(A)* RNA sample is 
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reverse transcribed using MMLV reverse-tninscriptase, 0.05 pg//il oHgo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units//*] RNase inhibitor, 500 j*M dATP, 500 fiM dGTP, 500 uM dTTP, 40 uM 
dCTP, 40 /iM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Biosciences). The reverse transcription 
reaction is performed in a 25 ml volume containing 200 ng poly(A)* RNA with GEMBRIGHT kits 

5 (Incyte Genomics). Specific control poly(A)* RNAs are synthesized by in vitro transcription from 
non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one with 
Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and incubated for 
20 minutes at 85°C to the stop the reaction and degrade die RNA. Samples are purified using two 
successive CHROMA SPIN 30 gel filtration spin columns (Clontech, Palo Alto CA) and after 

10 combining, both reaction samples are ethanol precipitated using 1 ml of glycogen (1 mg/ml), 60 ml 
sodium acetate, and 300 ml of 100% ethanol The sample is then dried to completion using a 
SpeedVAC (Savant Instruments Inc., Holbrook NY) and resuspended in 14 /il 5X SSO0.2% SDS. 
Microarrav Preparation 

Sequences of the present invention are used to generate array elements. Each array element 

IS is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to die vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 /ig. 
Amplified array elements are then purified using SEPHACRYL-400 (Amersham Biosciences). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

20 slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma- Aldrich, St Louis MO) in 95% ethanol Coated slides 
are cured in a 110°C oven. 

25 Array elements are applied to the coated glass substrate using a procedure described in U.S. 

Patent No. 5,807,522, incorporated herein by reference. 1 /il of the array element DNA, at an average 
concentration of 100 ng//xl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crossfinked using a STRATALJNKER UV-crosslinker (Stratagene). 

30 Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 0.2% 
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SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 p\ of sample mixture consisting of 0.2 fig each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 

5 mixture is heated to 65° C for 5 minutes and is aliquoted onto the micro array surface and covered with 
an 1.8 cm 2 coversHp. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 140 
fd of SX SSC in a corner of the chamber. The chamber containing the arrays is incubated for about 
6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash buffer (IX SSC, 0.1% 

10 SDS), three times for 10 minutes each at 45° C in a second wash buffer (0. IX SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 

15 focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 

20 Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between die array and die photomultiplier tubes are used to filter the signals. Hie 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 

25 although the apparatus is capable of recording die spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that location 
to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples from 

30 different sources (e.g., representing test and control cells), each labeled with a different fluorophore, 
are hybridized to a single array for the purpose of identifying genes that are differentially expressed, 
the calibration is done by labeling samples of the calibrating cDNA with the two fluorophores and 



103 



WO 03/068943 



PCT/US03/04712 



adding identical amounts of each to the hybridization mixture. 

The output of die photomultiplier tube is digitized using a 12-bit RT1-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 

5 linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
signal). Hie data is also analyzed quantitatively. Where two different fhiorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping emission 
spectra) between the fhiorophores using each fhiorophores emission spectrum. 

A grid is superimposed over die fluorescence signal image such that die signal from each spot 

10 is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to die average intensity of die signal. The software used 
for signal analysis is die GEMTOOLS gene expression analysis program (Incyte Genomics). Array 
elements that exhibit at least about a two-fold change in expression, a signal-to-background ratio of at 
least about 2.5, and an element spot size of at feast about 40%, are considered to be differentially 

15 expressed. 
Expression 

For example, SEQ ID NO:62, SEQ ID NO:74, and SEQ ID NO:75 showed increased 
expression in Tangier disease-derived fibroblasts compared to normal fibroblasts. In addition, both 
types of cells were cultured in die presence of cholesterol and compared with the same cell type 

20 cultured in the absence of cholesterol Human fibroblasts were obtained from skin explants from both 
normal subjects and two patients with homozygous Tangier disease. Cell lines were immortalized by 
transfection with human papillomavirus 16 genes E6 and E7 and a neomycin resistance selectable 
marker. TD-derived cells are deficient in an assay of apoA-I mediated tritiated cholesterol efflux. 
Therefore, SEQ ID NO:62, SEQ ID NO:74, and SEQ ID NO:75 are useful in diagnostic assays for 

25 Tangier disease. 

As another example, SEQ ID NO:62 showed decreased expression in lung tissue affected by 
adenocarcinoma versus uninvolved lung tissue, as determined by micro array analysis. Moderately 
differentiated adenocarcinoma of the right hing was compared to grossly uninvolved lung tissue from a 
60 year-old donor (Huntsman Cancer Institute). Therefore, SEQ ID NO:62 is useful in monitoring 
30 treatment of, and diagnostic assays for, lung adenocarcinoma and other cell proliferative disorders. 

As another example, SEQ ID NO:65 showed increased expression in colon tissue affected by 
adenocarcinoma versus uninvolved colon tissue, as determined by micro array analysis. Gene 
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expression profiles were obtained by comparing normal colon tissue from a 60-year-old donor to colon 
adenocarcinoma tumor tissue from the same donor (Huntsman Cancer Institute, Salt Lake City, UT). 
Therefore, SEQ ID NO:65 is useful in monitoring treatment of, and diagnostic assays for, colon 
adenocarcinoma and other cell proliferative disorders. 

5 As another example, SEQ ID NO:69 showed increased expression in breast carcinoma cells 

versus a nonmalignant mammary epithelial cell line. The gene expression profile of a nonmalignant 
mammary epithelial cell line was compared to the gene expression profiles of breast carcinoma lines at 
different stages of tumor progression. CeD lines compared included: a) BT-20, a breast carcinoma cell 
line derived in vitro from the cells emigrating out of thin slices of tumor mass isolated from a 74-year- 

10 old female, b) BT-474, a breast ductal carcinoma cell line that was isolated from a solid, invasive 
ductal carcinoma of the breast obtained from a 60-year-old woman, c) BT-483, a breast ductal 
carcinoma cell line that was isolated from a papillary invasive ductal tumor obtained from a 23-year- 
old normal, menstruating, parous female with a family history of breast cancer, d) Hs 57 8T, a breast 
ductal carcinoma cell line isolated from a 74-year-old female with breast carcinoma, e) MCF7, a 

15 nonmalignant breast adenocarcinoma cell line isolated from die pleural effusion of a 69-year-old 

female, f) MCF-10A, a breast mammary gland (luminal ductal characteristics) cell line isolated, from a 
36-year-old woman with fibrocystic breast disease, g) MDA-MB-468, a breast adenocarcinoma cell 
line isolated from the pleural effusion of a 51-year-old female with metastatic adenocarcinoma of the 
breast, and h) HMEC, a primary breast epithelial cell line isolated from a normal donor. Therefore, 

20 SEQ ID NO:69 is useful in monitoring treatment of, and diagnostic assays for, breast carcinoma and 
other cell proliferative disorders. 

As another example, SEQ ID NO:70 showed decreased expression in C3 A cells treated with 
gemfibrozil versus untreated cells. Early confluent C3A cells were treated with various amounts of 
Gemfibrozil (120, 600, 800, and 1200 jig/ml) dissolved CMC, for 1, 3, and 6 hours. Parallel samples of 

25 C3A cells were treated with 1% CMC only, as a control Therefore, SEQ ID NO:70 is useful in 
monitoring treatment of, and diagnostic assays for, coronary heart disease and other 
autoimmune/inflammatory disorders. 

As another example, SEQ ID NO: 80 showed differential expression in cancerous tissues 
versus normal tissues as determined by microarray analysis. In one experiment, the expression of 

30 SEQ ID NO:80, as determined by microarray analysis, was decreased by at least two fold in breast 
tumor tissues relative to normal breast tissues. The breast tumor tissues were harvested from a 43 
year old female donor diagnosed with invasive lobular carcinoma. The tumor is well differentiated and 
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metastatic. The normal breast tissues were harvested from grossly uninvolved breast tissue of the 
same donor. Therefore, SEQ ID NO:80 is useful as a diagnostic marker or as a potential therapeutic 
target for breast cancer. 

In the second experiment, the expression of SEQ ID NO:80, as determined by micro array 

S analysis, was increased by at least two fold in sigmoid colon tissues relative to normal sigmoid colon 
tissues. The sigmoid colon tumor tissue which originated from a metastatic gastric sarcoma (stromal 
tumor) was harvested from a 48 year old female donor. The normal sigmoid colon tissue was 
harvested from grossly uninvolved sigmoid colon tissue of the same donor. Therefore, SEQ ID NO:80 
is useful as a diagnostic marker or as a potential therapeutic target for colon cancer. 

10 In the third experiment, SEQ ID NO.80 also showed decreased expression in tissue affected 

by hing adenocarcinoma versus normal lung tissue as determined by micro array analysis. A sample of 
right lung tissue that showed moderately differentiated adenocarcinoma was compared to grossly 
uninvolved hing tissue from a 60 year old donor (Huntsman Cancer Institute, Salt Lake City, UT). 
Therefore, SEQ ID NO:80 is useful in diagnostic assays for and monitoring treatment of lung cancer. 

15 In the fourth experiment, the expression of SEQ ID NO:80 was decreased by at least two fold 

in ovarian adenocarcinoma tissues relative to grossly uninvolved normal ovarian tissue from a 79 year 
old donor. Therefore, SEQ ID NO:80 is useful as a diagnostic marker or as a potential therapeutic 
target for ovarian cancer. 

The four experiments described above showed that SEQ ID NO:80 is useful as a diagnostic 

20 marker or as a potential therapeutic target for breast, colon, lung, and ovarian cancers. 

In an alternative example, die expression of SEQ ID NO:80 were increased by at least two- 
fold in treated human adipocytes from obese donors when compared to non-treated adipocytes from 
the same donors. The obese human primary subcutaneous preadipocytes were isolated from adipose 
tissue of a 36-year-old healthy female with a body mass index (BMI) of 27.7 (overweight but 

25 otherwise healthy). The preadipocytes were cultured and induced to differentiate into adipocytes by 
culturing them in die differentiation medium containing die active components, PPAR-y agonist and 
human insulin. Human preadipocytes were treated with human insulin and PPAR-y agonist for three 
days and subsequently were switched to medium containing insulin for 5, 9, and 12 more days before 
the cells were collected for analysis. Differentiated adipocytes were compared to untreated 

30 preadipocytes maintained in culture in die absence of inducing agents. An overall differentiation rate 
of more than 60% was observed after 15 days in culture. The experiment show that SEQ ID NO: 80 
is useful for the diagnosis, prognosis, or treatment of diabetes mellitus and other disorders, such as 
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obesity, hypertension, and atherosclerosis. 

In another example, SEQ ID NO:81 showed differential expression in inflammatory responses 
as determined by microarray analysis. The expression of SEQ ID NO: 81 was increased by at least 
two fold in peripheral blood mononuclear cells (PBMCs, 12% B lymphocytes, 40% T lymphocytes, 
5 20% NK cells, 25% monocytes, and 3% various cells that include dendritic and progenitor cells) 

treated with 10 ng/tnl recombinant interleukin 5 (IL-5) for 2 hours relative to untreated cells. EL-5 is a 
T cell-derived factor that promotes the proliferation, differentiation, and activation of eosinophils. IL-5 
exerts its activity on target ceDs by binding to specific cell surface receptors. The functional high- 
affinity receptor for human IL-5 is composed of a low-affinity IL-5 binding a-subunit and a non- 
10 binding common b-subunit that is shared with the high-affinity receptors for GM-CSF and EL-3. 

Therefore, this experiment showed that SEQ ID NO:81 is useful in diagnostic assays for inflammatory 
responses. 

In addition, the expression of SEQ ID NO:81 is up-regulated in immature dendritic cells 
relative to monocytes by at least three-fold. Hie types of cDNAs that are up-regulated in concert 

15 during the transition from monocyte to dendritic cell reflect DC's newly acquired functions, such as 
antigen uptake. This experiment also showed that SEQ ID NO: 81 is useful in diagnostic assays for 
inflammatory responses. 

Further, as determined by microarray analysis, SEQ ID NO:81 showed differential expression 
in BT483 breast carcinoma cell line versus HMEC primary mammary epithelial cells. BT483 is a 

20 breast ductal carcinoma cell line isolated from a papillary invasive ductal tumor from a 23-year-old 
normal, menstruating, parous female. HMEC, a primary mammary epithelial cell line was derived 
from normal human mammary tissue (Clonetics, San Diego, CA). The microarray experiments 
showed that die expression of SEQ ID NO.81 was increased by at least two fold in BT483 breast 
ductal carcinoma line grown either in die presence or the absence of growth factors or nutrients 

25 relative to HMEC primary mammary epithelial cells grown in die absence of growth factors and 
nutrients. Therefore, SEQ ID NO:81 is useful as diagnostic markers or as potential therapeutic 
targets for breast cancer. 

SEQ ID NO:81 showed differential expression in prostate carcinoma cell lines versus normal 
prostate epithelial cells as determined by microarray analysis. Three prostate carcinoma cell lines, DU 

30 145, LNCaP, and PC-3 were included in the experiments. DU 145 was isolated from a metastatic site 
in the brain of a 69 year old male with widespread metastatic prostate carcinoma. DU 145 has no 
detectable sensitivity to hormones; forms colonies in semi-solid medium; is only weekly positive for 
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acid phosphatase; and cells are negative for prostate specific antigen (PSA). LNCaP is a prostate 
carcinoma cell line isolated from a lymph node biopsy of a SO year old male with metastatic prostate 
carcinoma. LNCaP expresses PSA, produces prostate acid phosphatase, and expresses androgen 
receptors. PCX), a prostate adenocarcinoma cell line, was isolated from a metastatic site in the bone 

5 of a 62 year old male with grade IV prostate adenocarcinoma. The normal epithelial cell line, PrEC, is 
a primary prostate epithelial cell line isolated from a normal donor. In one experiment, the expression 
of cDNAs from the prostate carcinoma cell lines were compared to that of the normal prostate 
epithelial ceDs grown under the same conditions (in the absence of growth factors and hormones). 
This experiment showed that the expression of SEQ ID NO:81 was increased by at feast two fold in 

10 both DU14S and LNCaP prostate carcinoma lines relative to PrECs. In the other experiment, the 
expression of cDNAs from the prostate carcinoma cell fines grown in optimal conditions (in the 
presence of growth factors and hormones) were compared to that of the normal prostate epithelial 
ceDs grown under restrictive conditions (in the absence of growth factors and hormones). The 
experiment showed that the expression of SEQ ID NO:8 1 was increased by at least two fold in 

15 DU145, LNCaP, and PC-3 prostate carcinoma lines relative to PrECs. Therefore, SEQ ID NO:81 is 
useful as a diagnostic marker or as a potential therapeutic target for prostate cancers. 

In yet another example, SEQ ID NO:82 showed differential expression, as determined by 
microarray analysis, in liver C3 A cells treated with insulin and LY294002. The human C3A cell line is 
a clonal derivative of HepG2/C3 (hepatoma cell line, isolated from a 15-year-old male with liver 

20 tumor), which was selected for strong contact inhibition of growth. The use of a clonal population 
enhances the reproducibility of die cells. C3 A cells have many characteristics of primary human 
hepatocytes in culture: i) expression of insulin receptor and insulin-like growth factor II receptor; ii) 
secretion of a high ratio of serum albumin compared with a-fetoprotein iii) conversion of ammonia to 
urea and glutamine; iv) metabolism of aromatic amino acids; and v) proliferation in glucose-free and 

25 insulin-free medium. The C3A cell line is now well established as an in vitro model of the mature 
human liver (Mickelson et aL (1995) Hepatology 22:866-875; Nagendra et aL (1997) Am J Physiol 
272:G408-G416). LY294002 is a PD kinase specific inhibitor that promotes cell cycle arrest of C3A 
cells and skews these cells towards a more 'liver-like' state. This factor also appears to enhance the 
metabolic activity of die cells, especially with respect to proteins such as P450. In this experiment, 

30 C3A cells were starved of insulin for 3 days (-3 d to 0 h) and then further cultured in the presence of 
both insulin and die PD-K inhibitor LY924002 for 0 hour, 24 hours, 2 hours and 3 days. The result 
showed that the expression of SEQ ID NO:82 was increased by at least a two-fold at in expression at 



108 



WO 03/068943 



PCT/US03/04712 



three out of the four time points (0 hour, 24 hours, and 2 hours). This experiment indicates that SEQ 
ID NO:82 is useful diagnostic assays for liver diseases and as a potential biological marker and 
therapeutic agent in the treatment of liver diseases and disorders. 

In an alternative example, the expression of SEQ ID NO:85 was decreased by at least two 

5 fold in peripheral blood mononuclear cells (PBMCs, 12% B lymphocytes, 40% T lymphocytes, 20% 
NK cells, 25% monocytes, and 3% various cells that include dendritic and progenitor cells) treated 
with 10 ng/ml recombinant interleukin 4 (IL-4) for 2 hours relative to untreated cells. IL-4 is a 
pleiotropic cytokine produced by activated T cells, mast cells, and basophils. It was initially identified 
as a B cell differentiation factor (BCDF) and a B cell stimulatory factor (BSF1). IL-4 exhibits anti- 

10 tumor effects both in vivo and in vitro. Recently, IL-4 was identified as an important regulator for the 
CD4+ subset (Thl-Hke vs. Th2-Kke) development The biological effects of IL4 are mediated by the 
binding of IL-4 to specific cell surface receptors. Hie functional high-affinity receptor for IL-4 
consists of a hgand-binding subunit (DL-4 R) and a second subunit (b chain) that can modulate the 
ligand binding affinity of the receptor complex. In certain cell types, the gamma chain of die DL-2 

IS receptor complex is a functional b chain of the IL-4 receptor complex. This experiment showed that 
SEQ ID NO:85 is useful in diagnostic assays for inflammatory responses. 

In another example, SEQ ID NO:87 showed differential expression in inflammatory responses 
as determined by microarray analysis. In one experiment, Raji B lymphoblast cells were stimulated in 
vitro with 0.1 mM soluble PMA (phorbol 12-myristate 13-acetate) and 1 mg/ml ionomycin for 0.5, 1, 

20 2, 4, and 8 hours. Treated cells were compared to untreated Raji cells kept in culture in the absence 
of stimuli. Raji is a B lymphoblast cell line (Burkitt's lymphoma) that was isolated from the left jaw of 
an 1 1-year-old male. This cell line tests positive for the Epstein-Barr Nuclear Antigen (EBNA) but 
does not cany any detectable virus particles. Raji has been extensively used to study signaling in 
human B cells, identify factors produced by human B cells, and study the anti -lymphoma immune 

25 response. PMA is a broad activator of die protein kinase G-dependent pathways. Ionomycin is a 

calcium ionophore that permits the entry of calcium in die cell, hence increasing the cytosolic calcium 
concentration. The combination of PMA and ionomycin activates two of the major signaling pathways 
used by mammalian cells to interact with their environment In T cells, the combination of PMA and 
ionomycin mimics the type of secondary signaling events elicited during optimal B cell activation. This 

30 experiment showed that expression of SEQ ID NO:87 was increased by at least two fold in three out 
of five time points (1, 2, and 4 hours). 

In another experiment, the expression of SEQ ID NO:87 was increased by at feast two fold in 
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peripheral blood mononuclear cells (PBMCs, 12% B lymphocytes, 40% T lymphocytes, 20% NK 
cells, 25% monocytes, and 3% various ceDs that include dendritic and progenitor cells) treated with 10 
ng/ml tumor necrosis factor alpha (TNF-a ) for 2 hours relative to untreated cells. TNF-a is produced 
by neutrophils, activated lymphocytes, macrophages, NK cells, LAK cells, astrocytes, endothelial cells, 

5 smooth muscle cells, and some transformed cells. TNF-a occurs as a secreted, soluble form and a 
membrane-anchored form, both of which are biologically active. TNF-a plays a critical role in 
mediation of die inflammatory response and in mediation of resistance to infections and tumor growth. 

In the third experiment, the expression of SEQ ID NO:87 was increased by at least two fold 
in peripheral blood mononuclear cells (PBMCs, 12% B lymphocytes, 40% T lymphocytes, 20% NK 

10 cells, 25% monocytes, and 3% various ceDs that include dendritic and progenitor cells) treated with 
lipopolysaccharide (LPS) for 2 or 4 hours relative to untreated ceDs. LPS elicits a variety of 
inflammatory responses, and because it activates complement by die alternative (properdin) pathway, 
it is often part of the pathology of gram-negative bacterial infections. It is thought that LPS released 
into the bloodstream by lysing gram-negative bacteria is first bound by certain plasma proteins 

15 • identified as LPS-binding proteins. The LPS-binding protein complex interacts with CD14 receptors 
on monocytes, macrophages, B ceDs, and other types of receptors on endothelial cells. Activation of 
human B ceDs with LPS results in mitogenesis as weD as immunoglobulin synthesis. In monocytes and 
macrophages three types of events are triggered during their interaction with LPS: 1) Production of 
cytokines, including IL-1, 11^6, IL-8, TNF-a, and platelet-activating factor. These in turn stimulate 

20 production of prostaglandins and leukotrienes, powerful mediators of inflammation and septic shock 
that accompany endotoxin toxemia. 2) Activation of the complement cascade. 3) Activation of the 
coagulation cascade. 

Therefore, the three experiments described above showed that SEQ ID NO:87 is useful in 
diagnostic assays for inflammatory responses. 

25 As a further example, the expression of SEQ ID NO.101 was increased by at least two fold in 

human umbilical vein endothelial ceDs (HUVECs) treated with 100 nM phorbol 12-myri state 
13-acetate (PMA) and 10 ng/mltumor necrosis factor-alpha (TNF-a) for 24 hours relative to untreated 
HUVECs. Human umbilical vein endothelial ceDs are primary cells derived from the endothelium of 
the human umbilical vein and have been used as an experimental model for investigating the role of the 

30 endothelium in human vascular biology. PMA is an agonist of protein kinase C (PKC) which is a 
calcium-activated, phospho-lipid-dependent serine- and threonine-specific kinase that upon activation 
phosphorylates a broad range of secondary targets. TNF-a is a pleiotropic cytokine that plays a 
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central role in mediating the inflammatory response through activation of multiple signal transduction 
pathways. TNF-a is produced by activated lymphocytes, macrophages, and other white blood cells 
and can activate endothelial cells. Monitoring die endothelial cells* response to TNF-a at the level of 
mRNA expression can provide information necessary for better understanding of both TNF-a signaling 

5 pathways and endothelial cell biology. TNF-a is also known to cause translocation of PKC from the 
cytosol to the membrane where it phosphorylates a variety of targets. Inhibition of PKC in die TNF-a 
activation experimental model will help clarify the PKC-dependent events during TNF-a signaling. 
The experiment indicated that SEQ ID NO.101 is useful in diagnosis, prognosis, or treatment of 
inflammatory disorders and endotheHal-related disorders, such as those delected in vascular tone 

10 regulation, coagulation, thrombosis, and atherosclerosis. 
XII. Complementary Polynucleotides 

Sequences complementary to the SECP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring SECP. Although use of oligonucleotides 
comprising from about IS to 30 base pairs is described, essentially the same procedure is used with 

15 smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of SECP. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to die coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the SECP-encoding transcript 

20 XIII. Expression of SECP 

Expression and purification of SECP is achieved using bacterial or virus-based expression 
systems. For expression of SECP in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, die trp-lac (tac) hybrid 

25 promoter and the TS or T7 bacteriophage promoter in conjunction with die lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express SECP upon induction with isopropyl beta-D-thiogalactopyranoside 
(IPTG). Expression of SECP in cukaryotic cells is achieved by infecting insect or mammalian cell 
lines with recombinant Auto graphic a californica nuclear polyhedrosis virus (AcMNPV), commonly 

30 known as baculovirus. The nonessential polyhedrin gene of baculovirus is replaced with cDNA 
encoding SECP by either homologous recombination or bacterial-mediated transposition involving 
transfer plasmid intermediates. Viral infectivity is maintained and the strong polyhedrin promoter 
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drives high levels of cDNA transcription. Recombinant baculo virus is used to infect Spodoptera 
frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. Infection of the 
latter requires additional genetic modifications to baculo virus (Engelhard, E.K. et aL (1994) Proc. NatL 
Acad. ScL USA 913224-3227; Sandig, V. et aL (1996) Hum. Gene Ther. 7:1937-1945). 

5 In most expression systems, SECP is synthesized as a fusion protein with, e.g., glutathione S- 

transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma japonicwn, enables die purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Biosciences). 

10 Following purification, die GST moiety can be proteolyticaDy cleaved from SECP at specifically 
engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinhy purification using 
commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a 
stretch of six consecutive histidine residues, enables purification on metal-chelate resins (Q1AGEN). 
Methods for protein expression and purification are discussed in Ausubel et aL (supra* ch. 10 and 16). 

IS Purified SECP obtained by these methods can be used directly in die assays shown in Examples XVII, 
XVffl, XDC, and XX, where applicable. 
XIV. Functional Assays 

SECP function is assessed by expressing the sequences encoding SECP at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 

20 vector containing a strong promoter that drives high levels of cDN A expression. Vectors of choice 
include PCMV SPORT plasmid (Invitrogen, Carlsbad CA) and PCR3.1 plasmid (Invhrogen), both of 
which contain the cytomegalovirus promoter. 5-10 /jg of recombinant vector are transiently 
transfected into a human cell line, for example, an endothelial or hematopoietic cell line, using either 
liposome formulations or ekctroporanon. 1-2 pg of an additional plasmid containing sequences 

25 encoding a marker protein are co-transfected. Expression of a marker protein provides a means to 
distinguish transfected cells from nontransfected cells and is a reliable predictor of cDNA expression 
from the recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein 
(GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser 
optics-based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to 

30 evaluate the apoptotic state of die ceDs and other cellular properties. FCM detects and quantifies the 
uptake of fluorescent molecules that diagnose events preceding or coincident with cell death. These 
events include changes in nuclear DNA content as measured by staining of DN A with propidium 
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iodide; changes in cell size and granularity as measured by forward light scatter and 90 degree side 
light scatter, down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine 
uptake; alterations in expression of cell surface and intracellular proteins as measured by reactivity 
with specific antibodies; and alterations in plasma membrane composition as measured by the binding 

5 of fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994; Flow Cytometry , Oxford, New York NY). 

The influence of SEGP on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding SECP and either CD64 or CD64-GFP. CD64 and 
CD64-GFP are expressed on die surface of transfected cells and bind to conserved regions of human 

10 immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD 64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art 
Expression of mRNA encoding SECP and other genes of interest can be analyzed by northern 
analysis or micro array techniques. 

IS XV. Production of SECP Specific Antibodies 

SECP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods EnzymoL 182:488-495), or other purification techniques, is used to 
immunize animals (e.g., rabbits, mice, etc.) and to produce antibodies using standard protocols. 
Alternatively, the SECP amino acid sequence is analyzed using LASERGENE software 

20 (DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophiHc regions are well 
described in the art (Ausubel et aL, supra, ch. 1 1). 

Typically, oligopeptides of about IS residues in length are synthesized using an ABI 431 A 

25 peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 

Aldrich, St Louis MO) by reaction with N-maleimidobenzoyl-N-hydmxysuccinimide ester (MBS) to 
increase immunogenicity (Ausubel et aL, supra). Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freunds adjuvant Resulting antisera are tested for anti peptide and anti-SECP 
activity by, for example, binding die peptide or SECP to a substrate, blocking with 1% BSA, reacting 

30 with rabbit antisera, washing, aid reacting with radio-iodinated goat anti-rabbit IgG. 
XVI. Purification of Naturally Occurring SECP Using Specific Antibodies 

Naturally occurring or recombinant SECP is substantially purified by immunoaffinity 
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chromatography using antibodies specific for SECP. An itnmiinoaffinity column is constructed by 
covalently coupling anti-SECP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Biosciences). After the coupling, the resin is blocked and 
washed according to the manufacturer's instructions. 

S Media containing SECP are passed over the imrmiBo affinity column, and die column is 

washed under conditions that allow die preferential absorb ance of SECP (e.g., high ionic strength 
buffers in the presence of detergent). The column is ehited under conditions that disrupt 
antibody/SECP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as 
urea or tfaiocyanate ion), and SECP is collected. 

10 XVII. Identification of Molecules Which Interact with SECP 

SECP, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter reagent 
(Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539). Candidate molecules previously 
arrayed in die wells of a multi-well plate are incubated with die labeled SECP, washed, and any wells 
with labeled SECP complex are assayed. Data obtained using different concentrations of SECP are 

15 used to calculate values for die number, affinity, and association of SECP with the candidate 
molecules. 

Alternatively, molecules interacting with SECP are analyzed using the yeast two-hybrid 
system as described in Helds, S. and O. Song (1989; Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

20 SECP may also be used in die PATHCALLING process (CuraGen Corp., New Haven CT) 

which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. 
Patent No. 6,057,101). 
XVHL Demonstration of SECP Activity 

25 An assay for growth stimulating or inhibiting activity of SECP measures the amount of DNA 

synthesis in Swiss mouse 3T3 cells (McKay, L and I. Leigh, eds. (1993) Growth Factors: A Practical 
A pproach . Oxford University Press, New York, NY). In this assay, varying amounts of SECP are 
added to quiescent 3T3 cultured cells in the presence of [ 3 HJthymidine, a radioactive DNA precursor. 
SECP for this assay can be obtained by recombinant means or from biochemical preparations. 

30 Incorporation of pHJthymidine into acid-precipitabfe DNA is measured over an appropriate time 
interval, and the amount incorporated is directly proportional to the amount of newly synthesized 
DNA. A linear dose-response curve over at least a hundred-fold SECP concentration range is 
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indicative of growth modulating activity. One unit of activity per milliliter is defined as the 
concentration of SECP producing a 50% response level, where 100% represents maximal 
incorporation of pHJthymidine into acid-precipitable DNA . 

Alternatively, an assay for SECP activity measures the stimulation or inhibition of 

5 neurotransmission in cultured cells. Cultured CHO fibroblasts are exposed to SECP. Following 
endocytic uptake of SECP, the cells are washed with fresh culture medium, and a whole cell voltage- 
clamped Xenopus myocyte is manipulated into contact with one of the fibroblasts in SECP-free 
medium. Membrane currents are recorded from the myocyte. Increased or decreased current 
relative to control values are indicative of neoromodulatory effects of SECP (Morimoto, T. et aL 

10 (1995) Neuron 15:689-696). 

Alternatively, an assay for SECP activity measures the amount of SECP in secretory, 
membrane-bound organelles. Transfected cells as described above are harvested and lysed. The 
lysate is fractionated using methods known to those of skill in die art, for example, sucrose gradient 
ultracentrifugation. Such methods allow die isolation of subcellular components such as die Golgi 

15 apparatus, ER, small membrane-bound vesicles, and other secretory organelles. Immunoprecipitations 
from fractionated and total cell lysates are performed using SECP-specific antibodies, and 
immunoprecipitated samples are analyzed using SDS-PAGE and immunoblotting techniques. The 
concentration of SECP in secretory organelles relative to SECP in total cell lysate is proportional to 
the amount of SECP in transit through die secretory pathway. 

20 Alternatively, AMP binding activity is measured by combining SECP with ^-labeled AMP. 

The reaction is incubated at 37°C and terminated by addition of trichloroacetic acid. The acid extract 
is neutralized and subjected to gel electrophoresis to remove unbound label. The radioactivity retained 
in the gel is proportional to SECP activity. 
XIX. Demonstration of Immunoglobulin Activity 

25 An assay for SECP activity measures the ability of SECP to recognize and precipitate 

antigens from serum. This activity can be measured by the quantitative precipitin reaction. (Gohib, 
E.S. et aL (1987) Immunology: A Synthesis. Sinauer Associates, Sunderland, MA, pp. 113-115.) 
SECP is isotopically labeled using methods known in die art Various serum concentrations are added 
to constant amounts of labeled SECP. SECP-antigen complexes precipitate out of solution and are 

30 collected by centrifugation. The amount of precipitable SECP-antigen complex is proportional to the 
amount of radioisotope detected in the precipitate. The amount of precipitable SECP-antigen complex 
is plotted against the serum concentration. For various serum concentrations, a characteristic 
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precipitin curve is obtained, in which the amount of precipitable SECP-antigen complex initially 
increases proportionately with increasing serum concentration, peaks at the equivalence point, and 
then decreases proportionately with further increases in semm concentration. Thus, the amount of 
precipitable SECP-antigen complex is a measure of SECP activity which is characterized by 

5 sensitivity to both limiting and excess quantities of antigen. 

Alternatively, an assay for SECP activity measures the expression of SECP on the cell 
surface. cDNA encoding SECP is transfected into a non-leukocytic cell line. Cell surface proteins 
are labeled with biotin (de la Fuente, M. A. et aL (1997) Blood 90:2398-2405). Immunoprecipitations 
are performed using SECP-specific antibodies, and immunoprecipitated samples are analyzed using 

10 SDS-PAGE and immunoblotting techniques. The ratio of labeled immunoprecipitant to unlabeled 
immunoprecipitant is proportional to die amount of SECP expressed on the cell surface. 

Alternatively, an assay for SECP activity measures the amount of cell aggregation induced by 
overexpression of SECP. In this assay, cultured cells such as NM3T3 are transfected with cDNA 
encoding SECP contained within a suitable mammalian expression vector under control of a strong 

15 promoter. Cotransfection with cDNA encoding a fluorescent marker protein, such as Green 

Fluorescent Protein (CLONTECH), is useful for identifying stable transfectants. The amount of cell 
agglutination, or clumping, associated with transfected cells is compared with that associated with 
untransfected cells. The amount of cell agglutination is a direct measure of SECP activity: 
XX. SECP Secretion Assay 

20 A high throughput assay may be used to identify polypeptides that are secreted in eukaryotic 

cells. In an example of such an assay, polypeptide expression libraries are constructed by fusing 5 - 
biased cDNAs to the 5 '-end of a leaderless ^lactamase gene, ^-lactamase is a convenient genetic 
reporter as it provides a high signal-to-noise ratio against low endogenous background activity and 
retains activity upon fusion to other proteins. A dual promoter system allows the expression of 0- 

25 lactamase fusion polypeptides in bacteria or eukaryotic cells, using the lac or CMV promoter, 
respectively. 

libraries are first transformed into bacteria, e.g., £ coli, to identify library members that 
encode fusion polypeptides capable of being secreted in a prokaryotic system. Mammalian signal 
sequences direct die translocation of ^-lactamase fusion polypeptides into die periplasm of bacteria 
30 where they confer antibiotic resistance to carbenicillin. Carbenicillin-selected bacteria are isolated on 
solid media, individual clones are grown in liquid media, and the resulting cultures are used to isolate 
library member plasmid DNA. 

116 



WO 03/068943 



PCT/US03/04712 



Mammalian cells, e.g., 293 cells, are seeded into 96-well tissue culture plates at a density of 
about 40,000 ceQsAveD in 100 jd phenol red-free DME supplemented with 10% fetal bovine serum 
(FBS) ( Life Technologies, RockviDe, MD). The following day, purified plasmid DNAs isolated from 
caibenicillin-resistant bacteria are diluted with 15 fil OFTI-MEM I medium (Life Technologies) to a 

5 volume of 25 fil for each well of cells to be transfected. In separate plates, 1 /il LF2000 Reagent (Life 
Technologies) is diluted into 25 /il/weD OFTI-MEM I. The 25 pi diluted LF2000 Reagent is then 
combined with the 25 /il diluted DNA, mixed briefly, and incubated for 20 minutes at room 
temperature. The resulting DNA-LF2000 reagent complexes are then added directly to each well of 
293 cells. Cells are also transfected with appropriate control plasmids expressing either wild-type P- 

10 lactamase, leaderless ^-lactamase, or, for example, CD4-fused leaderkss ^-lactamase. 24 his 

following transfection, about 90 pi of cell culture media are assayed at 37°C with 100 pM Nitrocefin 
(Calbiochem, San Diego, CA) and 0 J mM oleic acid (Sigma Corp. St Louis, MO) in 10 mM 
phosphate buffer (pH 7.0). Nitrocefin is a substrate for ^-lactamase mat undergoes a noticeable color 
change from yellow to red upon hydrolysis, ^-lactamase activity is monitored over 20 min in a 

15 microtiter plate reader at 486 nm. Increased color absorption at 486 nm corresponds to secretion of a 
P-lactamase fusion polypeptide in the transfected cell media, resulting from the presence of a 
eukaryotic signal sequence in the fusion polypeptide. Polynucleotide sequence analysis of the 
corresponding library member plasmid DNA is then used to identify the signal sequence-encoding 
cDNA. (Described in U.S. Patent application 09/803317, filed March 9, 2001.) 

20 For example, SEQ ID NO:28 was shown to be a secreted protein using this assay. 

Various modifications and variations of the described compositions, methods, and systems of 
the invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. It will be appreciated that the invention provides novel and useful proteins, and their 

25 encoding polynucleotides, which can be used in the drug discovery process, as well as methods for 
using these compositions for the detection, diagnosis, and treatment of diseases and conditions. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Nor should the description of such embodiments be considered exhaustive or limit the invention to the 

30 precise forms disclosed. Furthermore, elements from one embodiment can be readily recombined with 
elements from one or more other embodiments. Such combinations can form a number of 
embodiments within the scope of die invention. It is intended that the scope of the invention be 
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Incyte Full Length Clones 




5974456CA2, 6024316CA2. 951I8339CA2 




90122156CA2. 90122 188CA2 




901063 14CA2. 90106346CA2. 90106370CA2, 90106378CA2, 
90106422CA2. 90106470CA2, 90106494CA2, 90106495CA2 
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Signature Seauences. Domains and Motifs ! 
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Potential Phosphorylation Sites: S42 


Signal Peptide: Ml -SI 5 


Signal Peptide: M1-V18, M1-P20, M1-L16, M1-P20 
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Signal Peptide: M40-S54, M40-A58. M40-A60, M40-G62, M40-S61, M40-W55 


Potential Phosphorylation Sites: S61 S70 S80 S181 S240 S289 T190 T265 T344 Y206 | 


[Potential Glycosylation Sites: N238 N288 
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Potential Phosphorylation Sites: S29 S45 S46 T23 


I Potential Glycosylation Sites: N 1 8 
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I Potential Phosphorylation Sites: S87 


iPotential Glycosylation Sites: N78 
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Cytosolic domain: M1-T6 
Transmembrane domain: V7-A24 
Non-cytosolic domain: P25-N52 
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Potential Phosphorylation Sites: S21 S174 T46 T176 T203 


IPotential Glycosylation Sites: N220 
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Signal Peptide: M1-A18, M1-R20 


Cytosolic domain; Q24-Q47 
Transmembrane domain: T4-L23 
Non-cytosolic domain: M1-S3 
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ISisnal Peptide: M1-G21, M1-E24 
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Cytosolic domain: M1-R29 
Transmembrane domain: L30-V52 
Non-cytosolic domain: V53-R95 


iGuanylate cyclases signature: L35-R93 


[Potential Phosphorylation Sites: S9 S14 S27 S64 S80 
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Cytosolic domain: M1-V6 
Transmembrane domain: L7-S29 
Non-cytosolic domain: T30-D57 


IPotential Phosphorylation Sites: T30 T49 
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Signal Peptide: M1-S17, M1-C18, M1-A21, M1-H23, M1-A24, M1-L30 1 


| Potential Phosphorylation Sites: S28 S5 1 i 


IPotential Olycosylation Sites: N48 


(Signal Peptide: M1-A26 | 


[Signal Peptide: M1-S31, M9-S31 ! 


I Immunoglobulin domain: T97-L194 


I Ig superfamily from SCOP: T93-S 197 | 


Cytosolic domain: R232-G31 1 
Transmembrane domain: L209-T231 
Non-cytosolic domain: M1-P208 
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Signature Sequences, Domains and Motifs 


EGF 

DM00003|P98163|1373-1460: S229-V308, C277-E350, C143-E223 
DM00003|P35556|2219-2292: S274-C346 


EGF-LIKE DOMAIN ! 
DM00864|I55476|159-241: N234-C311, C277-D355, R146-E223 


SUSHI REPEAT 

DM04887IP16581I1-609: C459-A882 

Aspartic acid and asparagine hydroxylation site: C71-C82, C195-C206. C281-C292. C322- 


C333 

EGF-Hke domain signature 1: C435-C446, C478-C489, C521-C532, C566-C577, C653-C664, 
C697-C708, C740-C751, C784-C795, C827-C838, C870-C881, C913-C924, C956-C967, 
0999-01010,01042-01053 


EGF-like domain signature 2: C80-C95, C122-C137, C163-C178, C204-C219, C250-C265, 
C290-C303, C331-C346, C435-C446, C566-C577, C697-C708, C740-C751, C784-C795, 
C827-C838, C913-C924, C956-C967, C999-C1010, C1042-C1053 


Calcium-binding EGF-Iike domain pattern signature: D56-C80, D18O-C204, D267-C290, 
D307-C331 


Potential Phosphorylation Sites: S49 S201 S313 S347 S567 S1008 T264 T399 T401 T418 
T487T660 T836T953 


iPotential Glycosylation Sites: N147 N447 N458 N634 N769 N856 N867 N893 N1054 
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Signal Peptide: M1-A22, M1-V24, M1-Y29, M1-G26 
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Signature Sequences, Domains and Motifs 


PAROTID SECRETORY PROTEIN 
DM04779|P07743|12-234: G12-L186 
DM04779|B42337|12-235: G12-L186 


Leucine zipper pattern: L44-L65 


[Potential Phosphorylation Sites: S20 S184 T50 T95 


iPotential Glycosylation Sites: N124 N132 
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PEP-utilizing enzymes signatures: M1-V38 


PROLINERICH GLA PROTEIN 2 
PD059428: M1-G33 


IPotential Phosphorylation Sites: S21 T16 T26 
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! Trp-Asp (WD) repeat proteins proteins BL00678: S161-W171 


Trp-Asp (WD-40) repeats signature: C149-G198, D249-F278, V192-F236, T64-A1 12, T22- 
S79, S107-P154 


|Beta G-protein (transducin) signature PR00319: 1159-T173, P196-W213 


G-protein beta WD-40 repeat signature PR00320: 1159-T173 


PROTEIN REPEAT WD TRPASP REPEATS CONTAINING CHROMOSOME NUCLEAR 
FACTOR I PD000061: S203-L257, S96-W129 


1 Trp-Asp (WD) repeats signature: L33-F47, 1159-T173, 1201-V215 


Potential Phosphorylation Sites: S3 S125 S161 S349 S393 S420 T22 T83 T119 T167 T174 
T209 T251 T299 T3 10 T352 T378 T379 T383 T427 Y363 


Potential Glycosylation Sites: N181 
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SignaLcleavage: M1-A53 


[Signal Peptide: M35-A53 j 


Cytosolic domain: E54-F224 
Transmembrane domain: C31-A53 
Non-cytosolic domain: M1-P30 


(Potential Phosphorylation Sites: S9 S66 S168 T87 T104 T141 Y96 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from die group consisting 
ofSEQIDNO:l-51, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-5, SEQ ID NO:7, SEQ ID NO:10-11, SEQ ID NO:14, SEQ ID NO:17-20, SEQ 
ID NO:23-25, SEQ ID NOS7-38, SEQ ID NO:40-46, and SEQ ID NO:48-51 

c) a polypeptide comprising a naturally occurring amino acid sequence at least 92% 
identical to an amino acid sequence selected from the group consisting of SEQ ID 
NO:6, SEQ ID N039, and SEQ ID N047, 

d) a polypeptide comprising a naturally occurring amino acid sequence at least 93% 
identical to the amino acid sequence of SEQ ID NO:21, 

e) a polypeptide comprising a naturally occurring amino acid sequence at least 94% 
identical to the amino acid sequence of SEQ ID NO:9, 

f) a polypeptide comprising a naturally occurring amino acid sequence at least 98% 
identical to the amino acid sequence of SEQ ID NO: 13, 

g) a polypeptide consisting essentially of a naturally occurring amino acid sequence at 
least 90% identical to an amino acid sequence selected from the group consisting of 
SEQ ID NO:8, SEQ ID NO:12, SEQ ID NO:15, SEQ ID N022, SEQ ID NO:26, 

h) a biologically active fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ED NO:l-51, and 

i) an immunogenic fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-51. 

2. An isolated polypeptide of claim 1 comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-51. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 
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5. An isolated polynucleotide of claim 4 comprising a polynucleotide sequence selected from 
the group consisting of SEQ ID NO52-102. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method of producing a polypeptide of claim 1, the method comprising: 

a) cuhuring a cell under conditions suitable for expression of the polypeptide, wherein 
said cell is transformed with a recombinant polynucleotide, and said recombinant 
polynucleotide comprises a promoter sequence operably linked to a polynucleotide 
encoding the polypeptide of claim 1, and 

b) recovering the polypeptide so expressed. 

10. A method of claim 9, wherein the polypeptide comprises an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-51. 

11. An isolated antibody which specifically binds to a polypeptide of claim 1. 

12. An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from die group 
consisting of SEQ ID NO52-102, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at feast 
90% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:52-53, SEQ ID N059-61, SEQ ID NO:65-67, SEQ ID NO:69, SEQ ID 
NO:71, SEQ ID NO:73-75, SEQ ID NO:77-79, SEQ ID NO:85-86 f SEQ ID NO:89- 
92, SEQ ID NO.95-97, and SEQ ID NO:99-102, 

c) a polynucleotide comprising a naturally occurring polynucleotide sequence at feast 
91% identical to a polynucleotide sequence selected from die group consisting of SEQ 
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ID NO:62, and SEQ ID NO:88, 

d) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
93% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:54-56, SEQ ID NO:63, SEQ ID NO:83, and SEQ ID NO:93, 

e) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
94% identical to the polynucleotide sequence of SEQ ID NO:64, 

f) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
96% identical to a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:68, SEQ ID NO:81, and SEQ ID NO:87, 

g) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
98% identical to die polynucleotide sequence of SEQ ID NO:94, 

h) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 
99% identical to a polynucleotide sequence selected from the group consisting of SEQ 

. ID NO:57, SEQ ID NO:82, and SEQ ID NO:98, 

i) a polynucleotide consisting essentially of a naturally occurring polynucleotide 
sequence at least 90% identical to a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:58, SEQ ID NO:70, SEQ ID NO:72, SEQ ID NO:76, SEQ 
ID NO:80, and SEQ ID NO:84, 

j) a polynucleotide complementary to a polynucleotide of a), 
k) a polynucleotide complementary to a polynucleotide of b), 
1) a polynucleotide complementary to a polynucleotide of c), 
m) a polynucleotide complementary to a polynucleotide of d), 
n) a polynucleotide complementary to a polynucleotide of e), 
o) a polynucleotide complementary to a polynucleotide of f), 
p) a polynucleotide complementary to a polynucleotide of g), 
q) a polynucleotide complementary to a polynucleotide of h), 
r) a polynucleotide complementary to a polynucleotide of i), and 
s) an RNA equivalent of a>r). 

13. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 12. 
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14. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, the method comprising: 

a) hybridizing die sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and 
which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target 
polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, die amount thereof. 

15. A method of claim 14, wherein the probe comprises at least 60 contiguous nucleotides. 

16. A method of detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 12, die method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain 
reaction amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

17. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient 

18. A composition of claim 17, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO.1-51. 

19. A method for treating a disease or condition associated with decreased expression of 
functional SECP, comprising administering to a patient in need of such treatment die composition of 
claim 17. 

20. A method of screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, die method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 
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b) detecting agonist activity in the sample. 

21. A composition comprising an agonist compound identified by a method of claim 20 and a 
pharmaeeutically acceptable excipient 

22. A method for treating a disease or condition associated with decreased expression of 
functional SECP, comprising administering to a patient in need of such treatment a composition of 
claim 21. 

23. A method of screening a compound for effectiveness as an antagonist of a polypeptide of 
claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in fee sample. 

24. A composition comprising an antagonist compound identified by a method of claim 23 and 
a pharmaeeutically acceptable excipient 

25. A method for treating a disease or condition associated with overexpression of functional 
SECP, comprising administering to a patient in need of such treatment a composition of claim 24. 

26. A method of screening for a compound that specifically binds to fee polypeptide of claim 
1, fee method comprising: 

a) combining fee polypeptide of claim 1 wife at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to fee test compound, thereby 
identifying a compound that specifically binds to the polypeptide of claim 1. 

27. A method of screening for a compound feat modulates fee activity of fee polypeptide of 
claim 1, fee method comprising: 

a) combining fee polypeptide of claim 1 wife at least one test compound under conditions 
permissive for the activity of fee polypeptide of claim 1 , 

b) assessing fee activity of fee polypeptide of claim 1 in fee presence of fee test 
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compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test 

compound with the activity of the polypeptide of claim 1 in the absence of the test 
compound, wherein a change in the activity of the polypeptide of claim 1 in the 
presence of the test compound is indicative of a compound that modulates the activity 
of the polypeptide of claim 1 . 

28. A method of screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
comprising: 

a) exposing a sample comprising Ac target polynucleotide to a compound, under 
conditions suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

c) comparing the expression of the target polynucleotide in die presence of varying 
amounts of the compound and in die absence of the compound. 

29. A method of assessing toxicity of a test compound, the method comprising: 

a) treating a biological sample containing nucleic acids with die test compound, 

b) hybridizing the nucleic acids of the treated biological sample with a probe comprising 
at least 20 contiguous nucleotides of a polynucleotide of claim 12 under conditions 
whereby a specific hybridization complex is formed between said probe and a target 
polynucleotide in the biological sample, said target polynucleotide comprising a 
polynucleotide sequence of a polynucleotide of claim 12 or fragment thereof, 

c) quantifying the amount of hybridization complex, and 

d) comparing the amount of hybridization complex in the treated biological sample with 
the amount of hybridization complex in an untreated biological sample, wherein a 
difference in die amount of hybridization complex in the treated biological sample is 
indicative of toxicity of the test compound. 

30. A method for a diagnostic test for a condition or disease associated with the expression of 
SECP in a biological sample, the method comprising: 

a) combining the biological sample with an antibody of claim 1 1 , under conditions suitable 
for the antibody to bind the polypeptide and form an antibody:polypeptide complex, 
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and 

b) detecting the complex, wherein die presence of the complex correlates with die 
presence of the polypeptide in the biological sample. 

31. The antibody of claim 1 1 , wherein the antibody is: 

a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab*)2 fragment, or 

e) a humanized antibody. 

32. A composition comprising an antibody of claim 11 and an acceptable excipient 

*-. , 

33. A method of diagnosing a condition or disease associated with the expression of SECP in 
a subject, comprising administering to said subject an effective amount of the composition of claim 32. 

34. A composition of claim 32, further comprising a label. 

35. A method of diagnosing a condition or disease associated with the expression of SECP in 
a subject, comprising administering to said subject an effective amount of the composition of claim 34. 

36. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
11, die method comprising: 

a) immunizing an animal with a polypeptide consisting of an amino acid sequence 
selected from the group consisting of SEQ ID NOil-51, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibodies from die animal, and 

c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which specifically binds to a polypeptide comprising an amino acid sequence 
selected from die group consisting of SEQ ID NO: 1-51. 

37. A polyclonal antibody produced by a method of claim 36. 
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38. A composition comprising the polyclonal antibody of claim 37 and a suitable carrier. 

39. A method of making a monoclonal antibody with the specificity of the antibody of claim 
1 1 , the method comprising: 

5 a) immunizing an animal with a polypeptide consisting of an amino acid sequence 

selected from the group consisting of SEQ ID NO:l-51, or an immunogenic fragment 
thereof, under conditions to elicit an antibody response, 

b) isolating antibody producing cells from the animal, 

c) fusing the antibody producing cells with immortalized cells to form monoclonal 
1 0 antibody-producing hybridoma cells, 

d) cuhuring the hybridoma cells, and 

e) isolating from die culture monoclonal antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQIDNO:l-51. 

15 

40. A monoclonal antibody produced by a method of claim 39. 

41. A composition comprising the monoclonal antibody of claim 40 and a suitable carrier. 

20 42. Hie antibody of claim 11, wherein the antibody is produced by screening a Fab expression 

library. 

43. The antibody of claim 11, wherein die antibody is produced by screening a recombinant 
immunoglobulin library. 

25 

44. A method of detecting a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-51 in a sample, the method comprising: 

a) incubating die antibody of claim 1 1 with die sample under conditions to allow specific 
binding of die antibody and the polypeptide, and 
30 b) detecting specific binding, wherein specific binding indicates die presence of a 

polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO:l-51 in the sample. 
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45. A method of purifying a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-51 from a sample, the method comprising: 

a) incubating the antibody of claim 1 1 with the sample under conditions to allow specific 
binding of the antibody and the polypeptide, and 
5 b) separating die antibody from the sample and obtaining the purified polypeptide 

comprising an amino add sequence selected from the group consisting of SEQ ID 
NO:l-51. 

46. A microarray wherein at least one element of the microarray is a polynucleotide of claim 

10 13. 

47. A method of generating an expression profile of a sample which contains polynucleotides, 
the method comprising: 

a) labeling the polynucleotides of the sample, 
15 b) contacting the elements of the microarray of claim 46 with the labeled polynucleotides 

of the sample under conditions suitable for the formation of a hybridization complex, 
and 

c) quantifying the expression of the polynucleotides in the sample. 

20 48. An array comprising different nucleotide molecules affixed in distinct physical locations 

on a solid substrate, wherein at least one of said nucleotide molecules comprises a first oligonucleotide 
or polynucleotide sequence specifically hybridizable with at least 30 contiguous nucleotides of a target 
polynucleotide, and wherein said target polynucleotide is a polynucleotide of claim 12. 

25 49. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 

completely complementary to at feast 30 contiguous nucleotides of said target polynucleotide. 

50. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to at least 60 contiguous nucleotides of said target polynucleotide. 

30 

51. An array of claim 48, wherein said first oligonucleotide or polynucleotide sequence is 
completely complementary to said target polynucleotide. 
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52. An array of claim 48, which is a microaiTay. 

53. An array of claim 48, further comprising said target polynucleotide hybridized to a 
nucleotide molecule comprising said first oligonucleotide or polynucleotide sequence. 

5 

54. An array of claim 48, wherein a linker joins at least one of said nucleotide molecules to 
said solid substrate. 

55. An array of claim 48, wherein each distinct physical location on the substrate contains 
10 multiple nucleotide molecules, and the multiple nucleotide molecules at any single distinct physical 

location have the same sequence, and each distinct physical location on die substrate contains 
nucleotide molecules having a sequence which differs from die sequence of nucleotide molecules at 
another distinct physical location on the substrate. 

15 56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

20 . 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 

60. A polypeptide of claim 1, comprising die amino acid sequence of SEQ ID NOl5. 
25 61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

62. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:7. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 8. 

30 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO.9. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:10. 
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66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 1 1. 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:12. 

68. A polypeptide of claim 1, comprising die amino acid sequence of SEQ ID NO:13. 

69. A polypeptide of claim 1, comprising die amino acid sequence of SEQ ID NO: 14. 

70. A polypeptide of claim 1 , comprising die amino add sequence of SEQ ID NO: 15. 

71. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 16. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l 8. 

74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

75. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:20. 

76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

77. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

78. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

79. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

80. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

81. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

82. A polypeptide of claim 1, comprising die amino acid sequence of SEQ ID NO:27. 
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83. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

84. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 

85. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 

86. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO*3 1 . 

87. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID N032. 

88. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:33. 

89. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:34. 

90. A polypeptide of claim 1 /comprising the amino acid sequence of SEQ ID NO:35. 

91. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:36. 

92. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:37. 

93. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID N038. 

94. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO'39. 

95. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:40. 

96. A polypeptide of claim 1, comprising the ammo acid sequence of SEQ ID NO:41. 

97. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:42. 

98. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:43. 

99. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:44. 
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100. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:45. 

101. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:46. 
5 102. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:47. 

103. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:48. 

104. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:49. 

10 

105. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO50. 

106. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:51. 

15 107. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO.52. 

108. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:53. 

20 

109. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:54. 

1 10. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

25 NO:55. 

111. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:56. 

30 1 12. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:57. 

1 13. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
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NO:58. 

1 14. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:59. 

5 

1 15. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO.60. 

1 16. A polynucleotide of claim 12, comprising die polynucleotide sequence of SEQ ID 

10 NO:61. 

1 17. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:62. 

IS 1 18. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:63. 

1 19. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:64. 

20 

120. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:65. 

121. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

25 NO:66. 

122. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:67. 

30 123. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:68. 

124. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
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NO:69. 

125. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:70. 

126. A polynucleotide of claim 12, comprising die polynucleotide sequence of SEQ ID 

NO:71. 

127. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:72. 

128. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:73. 

129. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:74. 

130. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:75. 

131. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:76. 

132. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:77. 

133. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:78. 

134. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:79. 

135. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
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NO:80. 

136. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:81. 

5 

137. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:82. 

138. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

10 NO:83. 

139. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:84. 

15 140. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:85. 

141. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:86. 

20 

142. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:87. 

143. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

25 NO:88. 

144. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:89. 

30 145. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:90. 

146. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

213 



WO 03/068943 PCT/US03/04712 
NO:91. 

147. A polynucleotide of claim 12, comprising fee polynucleotide sequence of SEQ ID 

NO:92. 

148. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:93. 

149. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:94. 

150. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:95. 

151. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:96. 

152. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 

NO:97. 

153. A polynucleotide of claim 12, comprising fee polynucleotide sequence of SEQ ID 

NO:98. 

154. A polynucleotide of claim 12, comprising fee polynucleotide sequence of SEQ ID 

NO:99. 

155. A polynucleotide of claim 12, comprising fee polynucleotide sequence of SEQ ID 
NO:100. 

156. A polynucleotide of claim 12, comprising the polynucleotide sequence of SEQ ID 
NO:101. 

157. A polynucleotide of claim 12, comprising fee polynucleotide sequence of SEQ ID 
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NO:102. 
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<160> 102 

<170> PERL Program 

<210> 1 

<211> 120 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> misc_£eature 

<223> Incyte ID No: 7506904CD1 

<400> 1 



Met 


Gly 


Gly 


Tyr Leu 


Lys 


Thr 


Arg Pro Trp 


Thr Leu Gin His 


Phe 


1 






5 






10 




15 


Tyr 


Leu 


Cys 


Leu Met 


Pro 


Ala 


Ala Thr Trp 


Leu Val Leu Leu 


Leu 








20 






25 




30 


Leu 


Leu 


Trp 


Leu Ser 


Leu 


Gly 


Val Lys Thr 


Gly Ser Cys Ser 


Gin 








35 






40 




45 


Pro 


Gin 


Asn 


Leu Cys 


Cys 


Leu 


Gly Thr Asp 


His His Cys Lys 


Arg 








50 






55 




60 


Gly 


Ser 


Cys 


Tyr Cys 


Asp 


Glu 


Phe Cys His 


Val Ala Pro Asp 


Cys 








65 






70 




75 


His 


Pro 


Asp 


His Ser 


Val 


Leu 


Cys Asn Pro 


Gly Asn Ser His 


Thr 








80 






85 




90 


Gly 


Pro 


lie 


Pro Pro 


Thr 


Ala 


Lys Leu Asp 


Ala Met Ala Gly 


Arg 








95 






100 




105 


Gly 


Lys 


Pro 


Phe Ala 


Cys 


Thr 


Ser Gly Gin 


Ser Arg Asp Val 


Ala 








110 






115 




120 



<210> 2 

<211> 79 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7506909CD1 

<400> 2 

Met Gly Gly Tyr Leu Lys Thr Arg Pro Trp Thr Leu Gin His Phe 
15 10 15 

Tyr Leu Cys Leu Met Pro Ala Ala Thr Trp Leu Val Leu Leu Leu 

20 25 30 

Leu Leu Trp Leu Ser Leu Gly Val Lys Thr Ala Ser Gin Met Thr 

35 40 45 

Lys Met Val Leu Gin Met Val Leu Arg Met Glu Asn Pro Pro Ser 

50 55 60 

Pro Ala Arg Ser His Leu Asp Trp Met Gin Ser Met Val Ser Ser 

65 70 75 

Leu Gin Val Leu 



<210> 3 
<211> 270 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7507096CD1 

<400> 3 

Met Ala Thr Leu Ser Phe Val Phe Leu Leu Leu Gly Ala Val Ser 
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1 






5 


10 




15 


Trp 


Pro 


Pro Ala 


Ser Ala 


Ser Gly Gin Glu 


Phe Trp Pro Gly 


Gin 








20 


25 




30 


Ser 


Ala 


Ala Asp 


He Leu 


Ser Gly Ala Ala 


Ser Arg Arg Arg 


Tyr 








35 


40 




45 


Leu 


Leu 


Tyr Asp 


Val Asn 


Pro Pro Glu Gly 


Phe Asn Leu Arg 


Arg 








50 


55 




60 


Asp 


Val 


Tyr He 


Arg He 


Ala Ser Leu Leu 


Lys Thr Leu Leu 


Lys 








65 


70 




75 


Thr 


Glu 


Glu Trp 


Val Leu 


Val Leu Pro Pro 


Trp Gly Arg Leu 


Tyr 








80 


85 




90 


His 


Trp 


Gin Ser 


Pro Asp 


He His Gin Val 


Arg He Pro Trp 


Ser 








95 


100 




105 


Glu 


Phe 


Phe Asp 


Leu Pro 


Ser Leu Asn Lys 


Asn He Pro Val 


He 








110 


115 




120 


Glu 


Tyr 


Glu Gin 


Phe He 


Ala Gly Leu Ser 


Leu His Arg Gly 


Ala 








125 


130 




135 


Pro 


Ala 


Ala Glu 


Lys His 


He Ser Pro Val 


Arg Asp Val Arg 


Gin 








140 


145 




150 


Ser 


Arg 


Glu Pro 


Thr Ser 


Arg Pro Leu Trp 


Arg Glu Arg He 


Leu 








155 


160 




165 


Gly 


Tyr 


Pro Ser 


Gin His 


Gly Val Cys Gin 


Ala Pro Ala Gly 


Gly 








170 


175 




180 


Gly 


Arg 


Arg Val 


Gin Glu 


Gin Thr Ser Gin 


Leu His Gly Arg 


Arg 








185 


190 




195 


Arg 


Gin 


Asp Pro 


Leu Pro 


Gly Gly Leu Asp 


Glu Asp Glu Gly 


Gin 








200 


205 




210 


Ala 


Gly 


Leu Arg 


Ala Arg 


Gly Pro Leu Pro 


Gly Ser Pro Pro 


Glu 








215 


220 




225 


Lys 


Lys 


Arg Phe 


His Leu 


Gly Ser Gin Thr 


Gly Cys Thr Gin 


Ser 








230 


235 




240 


Gly 


Arg 


Gly Arg 


Glu Glu 


Asp Pro Gin Pro 


His Glu Asp Pro 


Pro 








245 


250 




255 


Ala 


Gly 


Gin Gly 


Val Cys 


Gly His Arg Cys 


Arg Gin Lys Gly 


He 








260 


265 




270 



<210> 4 

<211> 225 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7507098CD1 

<400> 4 

Met Arg Gly Glu Gly Gly Ala Gly Arg Cys Arg Gly Tyr Trp Glu 
15 10 15 

Arg He Ser Arg Gly Arg Gly Val Val Pro Gly Arg Gly Ala Ala 

20 25 30 

Ala Gly Ser Ala Trp Pro Pro Gly Ala Met Ala Thr Leu Ser Phe 

35 40 45 

Val Phe Leu Leu Leu Gly Ala Val Ser Trp Pro Pro Ala Ser Ala 

50 55 60 

Ser Gly Gin Glu Phe Trp Pro Gly Gin Ser Ala Ala Asp He Leu 
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65 



70 



75 



Ser Gly Ala Ala Ser Arg Arg Arg Tyr Leu Leu Tyr Asp Val Asn 
80 85 90 

Pro Pro Glu Gly Phe Asn Leu Arg Arg Asp Val Tyr lie Arg lie 
95 100 105 

Ala Ser Leu Leu Lys Thr Leu Leu Lys Thr Glu Glu Trp Val Leu 

110 115 120 

Val Leu Pro Pro Trp Gly Arg Leu Tyr His Trp Gin Ser Pro Asp 

125 130 135 

lie His Gin Val Arg lie Pro Trp Ser Glu Phe Phe Asp Leu Pro 

140 145 150 

Ser Leu Asn Lys Asn He Pro Val He Glu Tyr Glu Gin Phe He 

155 160 165 

Ala Glu Ser Gly Gly Ser Phe He Asp Gin Val Tyr Val Leu Gin 

170 175 180 

Ser Tyr Ala Glu Gly Trp Lys Glu Gly Thr Trp Glu Glu Lys Val 

185 190 195 

Asp Glu Arg Pro Cys He Asp Gin Leu Leu Tyr Ser Gin Asp Lys 

200 205 210 

His Glu Tyr Tyr Arg Ala Gin Pro Pro Ser Trp Arg Pro Cys Cys 



<210> 5 

<211> 283 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 7507099CD1 

<400> 5 

Met Arg Gly Glu Gly Gly Ala Gly Arg Cys Arg Gly Tyr Trp Glu 
15 10 15 

Arg He Ser Arg Gly Arg Gly Val Val Pro Gly Arg Gly Ala Ala 
20 25 30 

Ala Gly Ser Ala Trp Pro Pro Gly Ala Met Ala Thr Leu Ser Phe 
35 40 45 

Val Phe Leu Leu Leu Gly Ala Val Ser Trp Pro Pro Ala Ser Ala 
50 55 60 

Ser Gly Gin Glu Phe Trp Pro Gly Gin Ser Ala Ala Asp He Leu 
65 70 75 

Ser Gly Ala Ala Ser Arg Arg Arg Tyr Leu Leu Tyr Asp Val Asn 
80 85 90 

Pro Pro Glu Gly Phe Asn Leu Arg Arg Asp Val Tyr He Arg He 
95 100 105 

Ala Ser Leu Leu Lys Thr Leu Leu Lys Thr Glu Glu Trp Val Leu 
110 115 120 

Val Leu Pro Pro Trp Gly Arg Leu Tyr His Trp Gin Ser Pro Asp 
125 130 135 

He His Gin Val Arg He Pro Trp Ser Glu Phe Phe Asp Leu Pro 
140 145 150 

Ser Leu Asn Lys Asn He Pro Val He Glu Tyr Glu Gin Phe He 
155 160 165 

Ala Glu Ser Gly Gly Pro Phe He Asp Gin Val Tyr Val Leu Gin 



215 



220 



225 
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170 



175 



180 



Ser Tyr Ala Qlu Gly Trp Lys Glu Gly Thr Trp Glu Glu Lys Val 

185 190 195 

Asp Glu Arg Pro Cys lie Asp Gin Leu His Gly Arg Arg Arg Gin 

200 205 210 

Asp Pro Leu Pro Gly Gly Leu Asp Glu Asp Glu Gly Gin Ala Gly 

215 220 225 

Leu Arg Ala Arg Gly Pro Leu Pro Gly Ser Pro Pro Glu Lys Lys 

230 235 240 

Arg Phe His Leu Gly Ser Gin Thr Gly Cys Thr Gin Ser Gly Arg 

245 250 255 

Gly Arg Glu Glu Asp Pro Gin Pro His Glu Asp Pro Pro Ala Gly 

260 265 270 

Gin Gly Val Cys Gly His Arg Cys Arg Gin Lys Gly lie 



<210> 6 
<211> 294 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7501399CD1 

<400> 6 

Met Ser Arg Ser Ala Thr Leu Leu Leu Cys Leu Leu Gly Cys His 
15 10 15 

Val Trp Lys Ala Val Thr Lys Thr Leu Arg Glu Pro Gly Ala Gly 
20 25 30 

Ala Gin Glu Val Thr Leu Lys Val His lie Ser Asp Ala Ser Thr 
35 40 45 

His Gin Pro Val Ala Asp Ala Leu lie Glu lie Phe Thr Asn Gin 
50 55 60 

Ala Ser He Ala Ser Gly Thr Ser Gly Thr Asp Gly Val Ala Phe 
65 70 75 

He Lys Phe Gin Tyr Lys Leu Gly Ser Gin Leu He Val Thr Ala 
80 85 90 

Ser Lys His Ala Tyr Val Pro Asn Ser Ala Pro Trp Lys Pro He 
95 100 105 

Arg Leu Pro Val Phe Ser Ser Leu Ser Leu Gly Leu Leu Pro Glu 
110 115 120 

Arg Ser Ala Thr Leu Met Val Tyr Glu Asp Val Val Gin He Val 
125 130 135 

Ser Gly Phe Gin Gly Ala Arg Pro Gin Pro Arg Val His Phe Gin 
140 145 150 

Arg Arg Ala Leu Arg Leu Pro Glu Asn Thr Ser Tyr Ser Asp Leu 
155 160 165 

Thr Ala Phe Leu Thr Ala Ala Ser Ser Pro Ser Glu Val Asp Ser 
170 175 180 

Phe Pro Tyr Leu Arg Gly Leu Asp Gly Asn Gly Thr Gly Asn Ser 
185 190 195 

Thr Arg His Asp Leu Thr Pro Val Thr Ala Val Ser Val His Leu 
200 205 210 

Leu Ser Ser Asn Gly Thr Pro Val Leu Val Asp Gly Pro He Tyr 



275 



280 



215 



220 



225 
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Val Thr Val Pro Leu Ala Thr Gin Ser Ser Leu Arg His Asn Ala 

230 235 240 

Tyr Val Ala Ala Trp Arg Phe Asp Gin Lys Leu Gly Thr Trp Leu 

245 250 255 

Lys Ser Gly Leu Gly Leu Val His Gin Glu Gly Ser Gin Leu Thr 

260 265 270 

Trp Thr Tyr lie Ala Pro Gin Leu Gly Tyr Trp Val Ala Ala Met 

275 280 285 

Ser Pro Pro lie Pro Gly Gly Ser Ala 

290 



<210> 7 

<211> 40 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7504768CD1 

<400> 7 

Met Trp Val Pro Val Val Phe Leu Thr Leu Ser Val Thr Trp He 
15 10 15 

Gly Gin Ser Ala Ala Val Phe Trp Cys Thr Pro Ser Gly Ser Ser 

20 25 30 

Gin Leu Pro Thr Ala Ser Gly Thr Lys Ala 

35 40 



<210> 8 

<211> 187 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7500757CD1 



<400> 8 

Met Gly Ala Pro Leu Ala Val Ala Leu Gly Ala Leu His Tyr Leu 
15 10 15 

Ala Leu Phe Leu Gin Leu Gly Gly Ala Thr Arg Pro Ala Gly His 

20 25 30 

Ala Pro Trp Asp Asn His Val Ser Gly His Ala Leu Phe Thr Glu 

35 40 45 

Thr Pro His Asp Met Thr Ala Arg Thr Gly Glu Asp Val Glu Met 

50 55 60 

Ala Cys Ser Phe Arg Gly Ser Gly Ser Pro Ser Tyr Ser Leu Glu 

65 70 75 

He Gin Trp Trp Tyr Val Arg Ser His Arg Asp Trp Thr Asp Lys 

80 85 90 

Gin Ala Trp Ala Ser Asn Gin Val Val Lys Val Val Gly Ser Asn 

95 100 105 

He Ser His Lys Leu Arg Leu Ser Arg Val Lys Pro Thr Asp Glu 
110 115 120 

Gly Thr Tyr Glu Cys Arg Val He Asp Phe Ser Asp Gly Lys Ala 
125 130 135 
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Arg His His Lys Val Lys Ala Tyr 
140 

Asn Ser Val Leu His Leu Pro Glu 
155 

Pro Pro Pro Pro Lys Pro Gly Lys 
170 

Asp Gin Glu Ala Cys Ser Leu 
185 

<210> 9 

<211> 261 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 1730616CD1 

<400> 9 

Met Lys lie Thr Asn Gly Arg His 

1 5 
Gly Thr Met Glu Asn Phe Thr Ala 
20 

Pro Pro Pro Pro Pro Thr Ala Leu 

35 

Pro Pro Pro Pro Pro Pro Pro Ala 
50 

Pro Pro Pro Thr Val Ala Thr Ala 
65 

Gly Ala Gly Cys Gly Pro Phe Tyr 
80 

Ser Thr Glu Leu Thr Gly Ser Thr 
95 

Leu Glu Gin Ala Tyr Asn Lys Phe 
110 

Lys Leu Ser Asn Phe Leu Pro Asp 
125 

Pro Gly Ser His Asp Asn Ser Ser 
140 

Pro Pro lie Leu Ser Ser Ser Phe 
155 

Leu Ala Gly Phe Arg Leu His Thr 
170 

Arg Leu Met His lie Gin Pro Pro 
185 

His Lys Gin Ser Arg Thr Gin Asp 
200 

Ser Asp Ser Asp His Lys Lys Lys 
215 

Pro Glu Arg Lys Arg Lys Lys Lys 
230 

Arg His Ser Pro Asp His Pro Gly 
245 

Ser Ser Ser Ser Leu Arg 
260 



Leu Arg Val Gin Pro Gly Glu 
145 150 

Ala Pro Pro Ala Ala Pro Ala 
160 165 

Glu Leu Arg Lys Arg Ser Val 
175 180 



Gly Asp 


Ser 


Ala Gly Ala Glu 


10 




15 


Leu Phe 


Gly 


Ala Gin Ala Asp 


25 




30 


Gly Phe 


Gly 


Pro Gly Lys Pro 


40 




45 


Gly Gly 


Gly 


Pro Gly Thr Ala 


55 




60 


Pro Pro 


Gly 


Ala Asp Lys Ser 


70 




75 


Leu Met 


Arg 


Glu Leu Pro Gly 


85 




90 


Asn Leu 


He 


Thr His Tyr Asn 


100 




105 


Cys Gly 


Lys 


Lys Val Lys Glu 


115 




120 


Leu Pro 


Gly 


Met He Asp Leu 


130 




135 


Leu Arg 


Ser 


Leu He Glu Lys 


145 




150 


Asn Pro 


He 


Thr Gly Thr Met 


160 




165 


Gly Pro 


Leu 


Pro Glu Gin Cys 


175 




180 


Lys Lys 


Lys 


Asn Lys His Lys 


190 




195 


Pro Val 


Pro 


Pro Glu Thr Pro 


205 




210 


Lys Lys 


Lys 


Lys Glu Glu Asp 


220 




225 


Glu Lys 


Lys 


Lys Lys Lys Asn 


235 




240 


Met Gly 


Ser 


Ser Gin Ala Ser 


250 




255 
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<210> 10 

<211> 506 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 190404CD1 

<400> 10 

Met He Asn Ser He Phe Glu Ala He Leu Gin He Leu Ser His 
15 10 15 

Pro Pro Ser Arg Arg Glu His Gly Tyr Asp Ala Val Val Leu Leu 
20 25 30 

Ala Leu Leu Val Asn Tyr Arg Lys Tyr Glu Ser Val Asn Pro Tyr 
35 40 45 

He Val Lys Leu Ser He Val Asp Asp Glu Ala Thr Leu Asn Gly 
50 55 60 

Met Gly Leu Val He Ala Gin Ala Leu Ser Glu Tyr Asn Arg Gin 
65 70 75 

Tyr Lys Asp Lys Glu Glu Glu His Gin Ser Gly Phe Phe Ser Ala 
80 85 90 

Leu Thr Asn Met Val Gly Ser Met Phe He Ala Asp Ala His Glu 
95 100 105 

Lys He Ser Val Gin Thr Asn Glu Ala He Leu Leu Ala Leu Tyr 

110 115 120 

Glu Ala Val His Leu Asn Arg Asn Phe He Thr Val Leu Ala Gin 

125 130 135 

Ser His Pro Glu Met Gly Leu Val Thr Thr Pro Val Ser Pro Ala 

140 145 150 

Pro Thr Thr Pro Val Thr Pro Leu Gly Thr Thr Pro Pro Ser Ser 

155 160 165 

Asp Val He Ser Ser Val Glu Leu Pro Leu Asp Ala Asp Val Gin 

170 175 180 

Thr Ser Asn Leu Leu He Thr Phe Leu Lys Tyr Ser Ser He Val 

185 190 195 

Met Gin Asp Thr Lys Asp Glu His Arg Leu His Ser Gly Lys Leu 

200 205 210 

Cys Leu He He Leu Thr Cys He Ala Glu Asp Gin Tyr Ala Asn 

215 220 225 

Ala Phe Leu His Asp Asp Asn Met Asn Phe Arg Val Asn Leu His 

230 235 240 

Arg Met Pro Met Arg His Arg Lys Lys Ala Ala Asp Lys Asn Leu 

245 250 255 

Pro Cys Arg Pro Leu Val Cys Ala Val Leu Asp Leu Met Val Glu 

260 265 270 

Phe He Val Thr His Met Met Lys Glu Phe Pro Met Asp Leu Tyr 

275 280 285 

He Arg Cys He Gin Val Val His Lys Leu Leu Cys Tyr Gin Lys 

290 295 300 

Lys Cys Arg Val Arg Leu His Tyr Thr Trp Arg Glu Leu Trp Ser 

305 310 315 

Ala Leu He Asn Leu Leu Lys Phe Leu Met Ser Asn Glu Thr Val 

320 325 330 

Leu Leu Ala Lys His Asn He Phe Thr Leu Ala Leu Met He Val 

335 340 345 
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Asn Leu Phe Asn Met Phe He Thr Tyr Gly Asp Thr Phe Leu Pro 

350 355 360 

Thr Pro Ser Ser Tyr Asp Glu Leu Tyr Tyr Glu He He Arg Met 

365 370 375 

His Gin Ser Phe Asp Asn Leu Tyr Ser Met Val Leu Arg Leu Ser 

380 385 390 

Thr Asn Ala Gly Gin Trp Lys Glu Ala Ala Ser Lys Val Thr His 

395 400 405 

Ala Leu Val Asn He Arg Ala He He Asn His Phe Asn Pro Lys 

410 415 420 

He Glu Ser Tyr Ala Ala Val Asn His He Ser Gin Leu Ser Glu 

425 430 435 

Glu Gin Val Leu Glu Val Val Arg Ala Asn Tyr Asp Thr Leu Thr 

440 445 450 

Leu Lys Leu Gin Asp Gly Leu Asp Gin Tyr Glu Arg Tyr Ser Glu 

455 460 465 

Gin His Lys Glu Ala Ala Phe Phe Lys Glu Leu Val Arg Ser He 

470 475 480 

Ser Thr Asn Val Arg Arg Asn Leu Ala Phe His Thr Leu Ser Gin 

485 490 495 

Glu Val Leu Leu Lys Glu Phe Ser Thr He Ser 

500 505 

<210> 11 

<211> 69 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7500679CD1 

<400> 11 

Met Gly Glu Leu Ser Pro Leu Gin Arg Pro Leu Ala Thr Glu Gly 
15 10 15 

Thr Met Lys Ala Gin Gly Val Leu Leu Lys Leu Ala Leu Leu Ala 

20 25 30 

Leu Pro Leu Leu Leu Leu Leu Ser Thr Pro Pro Cys Ala Pro Gin 

35 40 45 

Val Ser Gly He Arg Gly Asp Gly Phe Pro Glu Glu He Gin Trp 

50 55 60 

Leu Ser Lys Phe Leu Pro Arg Leu Glu 

65 

<210> 12 

<211> 121 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7500687CD1 

<400> 12 

Met Ala Leu Pro Pro Gly Pro Ala Ala Leu Arg His Thr Leu Leu 
15 10 15 
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Leu Leu Pro Ala Leu Leu Ser Ser Gly Pro Ser Arg His Pro Ser 

20 25 30 

Leu He Ser Ser Asp Ser Asn Asn Leu Lys Leu Asn Asn Val Arg 

35 40 45 

Leu Pro Arg Glu Asn Met Ser Leu Pro Ser Asn Leu Gin Leu Asn 

50 55 60 

Asp Leu Thr Pro Asp Ser Arg Ala Val Lys Pro Ala Asp Arg Gin 

65 70 75 

Net Ala Gin Asn Asn Ser Arg Pro Glu Leu Leu Asp Pro Glu Pro 

80 85 90 

Gly Gly Leu Leu Thr Ser Gin Gly Phe He Arg Leu Pro Val Leu 

95 100 105 

Gly Tyr He Tyr Arg Val Ser Ser Val Ser Ser Asp Glu He Trp 

110 115 120 

Leu 



<210> 13 

<211> 290 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_ feature 

<223> Incyte ID No: 7500688CD1 

<400> 13 

Met Ala Leu Pro Pro Gly Pro Ala Ala Leu Arg His Thr Leu Leu 

15 10 15 

Leu Leu Pro Ala Leu Leu Ser Ser Gly Gly Pro Gly Thr Pro Arg 

20 25 30 

Leu Ala Trp Tyr Leu Asp Gly Gin Leu Gin Glu Ala Ser Thr Ser 

35 40 45 

Arg Leu Leu Ser Val Gly Gly Glu Ala Phe Ser Gly Gly Thr Ser 

50 55 60 

Thr Phe Thr Val Thr Ala His Arg Ala Gin His Glu Leu Asn Cys 

65 70 75 

Ser Leu Gin Asp Pro Arg Ser Gly Arg Ser Ala Asn Ala Ser Val 

80 85 90 

He Leu Asn Val Gin Phe Lys Pro Glu He Ala Gin Val Gly Ala 

95 100 105 

Lys Tyr Gin Glu Ala Gin Gly Pro Gly Leu Leu Val Val Leu Phe 

110 115 120 

Ala Leu Val Arg Ala Asn Pro Pro Ala Asn Val Thr Trp He Asp 

125 130 135 

Gin Asp Gly Pro Val Thr Val Asn Thr Ser Asp Phe Leu Val Leu 

140 145 150 

Asp Ala Gin Asn Tyr Pro Trp Leu Thr Asn His Thr Val Gin Leu 

155 160 165 

Gin Leu Arg Ser Leu Ala His Asn Leu Ser Val Val Ala Thr Asn 

170 175 180 

Asp Val Gly Val Thr Ser Ala Ser Leu Pro Ala Pro Gly Pro Ser 

185 190 195 

Arg His Pro Ser Leu He Ser Ser Asp Ser Asn Asn Leu Lys Leu 

200 205 210 

Asn Asn Val Arg Leu Pro Arg Glu Asn Met Ser Leu Pro Ser Asn 
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215 220 225 

Leu Gin Leu Asn Asp Leu Thr Pro Asp Ser Arg Ala Val Lys Pro 

230 235 240 

Ala Asp Arg Gin Met Ala Gin Asn Asn Ser Arg Pro Glu Leu Leu 

245 250 255 

Asp Pro Glu Pro Gly Gly Leu Leu Thr Ser Gin Gly Phe lie Arg 

260 265 270 

Leu Pro Val Leu Gly Tyr lie Tyr Arg Val Ser Ser Val Ser Ser 

275 280 285 

Asp Glu lie Trp Leu 

290 

<210> 14 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc — feature 

<223> Incyte ID No: 7500697CD1 

<400> 14 

Met Ser Ala Leu Ser Leu Leu He Leu Gly Leu Leu Thr Ala Val 
15 10 15 

Pro Pro Ala Ser Cys Gin Gin Gly Ser Leu His Thr 
20 25 

<210> 15 

<211> 500 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 7500709CD1 

<400> 15 

Met Leu Asp His Lys Asp Leu Glu Ala Glu He His Pro Leu Lys 
15 10 15 

Asn Glu Glu Arg Lys Ser Gin Glu Asn Leu Gly Asn Pro Ser Lys 

20 25 30 

Asn Glu Asp Asn Val Lys Ser Ala Pro Pro Gin Ser Arg Leu Ser 

35 40 45 

Arg Cys Arg Ala Ala Ala Phe Phe Leu Ser Leu Phe Leu Cys I>eu 

50 55 60 

Phe Val Val Phe Val Val Ser Phe Val He Pro Cys Pro Asp Arg 

65 70 75 

Pro Ala Ser Gin Arg Met Trp Arg He Asp Tyr Ser Ala Ala Val 

80 85 90 

He Tyr Asp Phe Leu Ala Val Asp Asp He Asn Gly Asp Arg He 

95 100 105 

Gin Asp Val Leu Phe Leu Tyr Lys Asn Thr Asn Ser Ser Asn Asn 
110 115 120 

Phe Ser Arg Ser Cys Val Asp Glu Gly Phe Ser Ser Pro Cys Thr 
125 130 135 

Phe Ala Ala Ala Val Ser Gly Ala Asn Gly Ser Thr Leu Trp Glu 
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140 145 150 

Arg Pro Val Ala Gin Asp Val Ala Leu Val Glu Cys Ala Val Pro 

155 160 165 

Gin Pro Arg Gly Ser Glu Ala Pro Ser Ala Cys lie Leu Val Gly 

170 175 180 

Arg Pro Ser Ser Phe lie Ala Val Asn Leu Phe Thr Gly Glu Thr 

185 190 195 

Leu Trp Asn His Ser Ser Ser Phe Ser Gly Asn Ala Ser lie Leu 

200 205 210 

Ser Pro Leu Leu Gin Val Pro Asp Val Asp Gly Asp Gly Ala Pro 

215 220 225 

Asp Leu Leu Val Leu Thr Gin Glu Arg Glu Glu Val Ser Gly His 

230 235 240 

Leu Tyr Ser Gly Ser Thr Gly His Gin lie Gly Leu Arg Gly Ser 

245 250 255 

Leu Gly Val Asp Gly Glu Ser Gly Phe Leu Leu His Val Thr Arg 

260 265 270 

Thr Gly Ala His Tyr lie Leu Phe Pro Cys Ala Ser Ser Leu Cys 

275 280 285 

Gly Cys Ser Val Lys Gly Leu Tyr Glu Lys Val Thr Glu Ser Gly 

290 295 300 

Gly Pro Phe Lys Ser Asp Pro His Trp Glu Ser Met Leu Asn Ala 

305 310 315 

Thr Thr Arg Arg Met Leu Ser His Ser Ser Gly Ala Val Arg Tyr 

320 325 330 

Leu Met His Val Pro Gly Asn Ala Gly Ala Asp Val Leu Leu Val 

335 340 345 

Gly Ser Glu Ala Phe Val Leu Leu Asp Gly Gin Glu Leu Thr Pro 

350 355 360 

Arg Trp Thr Pro Lys Ala Ala His Val Leu Arg Lys Pro lie Phe 

365 370 375 

Gly Arg Tyr Lys Pro Asp Thr Leu Ala Val Ala Val Glu Asn Gly 

380 385 390 

Thr Gly Thr Asp Arg Gin Glu Thr Gly Glu Ala Arg His Ser Leu 

395 400 405 

Tyr Met Phe His Pro Thr Leu Pro Arg Val Leu Leu Glu Leu Ala 

410 415 420 

Asn Val Ser Thr His He Val Ala Phe Asp Ala Val Leu Phe Glu 

425 430 435 

Pro Ser Arg Hi 3 Ala Ala Tyr He Leu Leu Thr Gly Pro Ala Asp 

440 445 450 

Ser Glu Ala Pro Gly Leu Val Ser Val He Lys His Lys Val Arg 

455 460 465 

Asp Leu Val Pro Ser Ser Arg Val Val Arg Leu Gly Glu Gly Gly 

470 475 480 

Pro Asp Ser Asp Gin Ala He Arg Asp Arg Phe Ser Arg Leu Arg 

485 490 495 

Tyr Gin Ser Glu Ala 

500 

<210> 16 

<211> 543 

<212> PRT 

<213> Homo sapiens 

<220> 
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<221> misc_feature 

<223> Incyte ID No: 7500711CD1 

<400> 16 



Met 


Leu 


Asp 


His Lys 


Asp 


Leu Glu Ala 


Glu 


He His Pro 


Leu 


Lys 


1 






5 






10 






15 


Asn 


Glu 


Glu 


Arg Lys 


Ser 


Gin Glu Asn 


Leu 


Gly Asn Pro 


Ser 


Lys 








20 






25 






30 


Asn 


Glu 


Asp 


Asn Val 


Lys 


Ser Ala Pro 


Pro 


Gin Ser Arg 


Leu 


Ser 








35 






40 






45 


Arc* 


Cys 


Arg 


Ala Ala 


Ala 


Phe Phe Leu 


Ser 


Leu Phe Leu 


Cys 


Leu 








50 






55 






60 


Phe 


Val 


Val 


Phe Val 


Val 


Ser Phe Val 


He 


Pro Cys Pro 


Asp 


Arg 








65 






70 






75 


Pro 


Ala 


Ser 


Gin Arg 


Met 


Trp Arg He 


Asp 


Tyr Ser Ala 


Ala 


Val 








80 






85 






90 


He 


Tyr 


Asp 


Phe Leu 


Ala 


Val Asp Asp 


He 


Asn Gly Asp 


Arg 


He 








95 






100 






105 


Gin 


Asp 


Val 


Leu Phe 


Leu 


Tyr Lys Asn 


Thr 


Asn Ser Ser 


Asn 


Asn 








110 






115 






120 


Phe 


Ser 


Arg 


Ser Cys 


Val 


Asp Glu Ala 


Ala 


Val Ser Gly 


Ala 


Asn 








125 






130 






135 


Gly 


Ser 


Thr 


Levi Trp 


Glu 


Arg Pro Val 


Ala 


Gin Asp Val 


Ala 


Leu 








140 






145 






150 


Val 


Glu 


Cys 


Ala Val 


Pro 


Gin Pro Arg 


Gly 


Ser Glu Ala 


Pro 


Ser 








155 






160 






165 


Ala 


Cys 


He 


Leu Val 


Gly 


Arg Pro Ser 


Ser 


Phe He Ala 


Val 


Asn 








170 






175 






180 


Leu 


Phe 


Thr 


Gly Glu 


Thr 


Leu Trp Asn 


His 


Ser Ser Ser 


Phe 


Ser 








185 






190 






195 


Gly 


Asn 


Ala 


Ser He 


Leu 


Ser Pro Leu 


Leu 


Gin Val Pro 


Asp 


Val 








200 






205 






210 


Asp 


Gly 


Asp 


Gly Ala 


Pro 


Asp Leu Leu 


Val 


Leu Thr Gin 


Glu 


Arg 








215 






220 






225 


Glu 


Glu 


Val 


Ser Gly 


His 


Leu Tyr Ser 


Gly 


Ser Thr Gly 


His 


Gin 








230 






235 






240 


He 


Gly 


Leu 


Arg Gly 


Ser 


Leu Gly Val 


Asp 


Gly Glu Ser 


Gly 


Phe 








245 






250 






255 


Leu 


Leu 


His 


Val Thr 


Arg 


Thr Gly Ala 


His 


Tyr He Leu 


Phe 


Pro 








260 






265 






270 


Cys 


Ala 


Ser 


Ser Leu 


Cys 


Gly Cys Ser 


Val 


Lys Gly Leu 


Tyr 


Glu 








275 






280 






285 


Lys 


Val 


Thr 


Gly Ser 


Gly 


Gly Pro Phe 


Lys 


Ser Asp Pro 


His 


Trp 








290 






295 






300 


Glu 


Ser 


Met 


Leu Asn 


Ala 


Thr Thr Arg 


Arg 


Met Leu Ser 


His 


Ser 








305 






310 






315 


Ser 


Gly 


Ala 


Val Arg 


Tyr 


Leu Met His 


Val 


Pro Gly Asn 


Ala 


Gly 








320 






325 






330 


Ala 


Asp 


Val 


Leu Leu 


Val 


Gly Ser Glu 


Ala 


Phe Val Leu 


Leu 


Asp 








335 






340 






345 


Gly 


Gin 


Glu 


Leu Thr 


Pro 


Arg Trp Thr 


Pro 


Lys Ala Ala 


His 


Val 








350 






355 






360 


Leu 


Arg 


Lys 


Pro He 


Phe 


Gly Arg Tyr 


Lys 


Pro Asp Thr 


Leu 


Ala 








365 






370 






375 


Val 


Ala 


Val 


Glu Asn 


Gly 


Thr Gly Thr 


Asp 


Arg Gin He 


Leu 


Phe 








380 






385 






390 
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Leu Asp 
Ser Leu 
His Arg 
Thr Ser 
Phe His 
Ser Thr 
Arg His 
Ala Pro 
Val Pro 
Ser Asp 
Ser Olu 



Leu Gly 
Pro Gly 
Ser Ala 
Glu Thr 
Pro Thr 
His He 
Ala Ala 
Gly Leu 
Ser Ser 
Gin Ala 
Ala 



Thr Gly 
395 

Gly Pro 
410 

Phe Phe 
425 

Glu Thr 
440 

Leu Pro 
455 

Val Ala 
470 

Tyr He 
485 

Val Ser 
500 

Arg Val 
515 

He Arg 
530 



Ala Val 
Leu Ser 
Phe Trp 
Gly Glu 
Arg Val 
Phe Asp 
Leu Leu 
Val He 
Val Arg 
Asp Arg 



Leu Cys Ser 

400 
Ala Ser Leu 

415 
Gly Leu His 

430 
Ala Arg His 

445 
Leu Leu Glu 

460 
Ala Val Leu 

475 
Thr Gly Pro 

490 
Lys His Lys 

505 
Leu Gly Glu 

520 
Phe Ser Arg 

535 



Leu Ala 
Pro Thr 
Glu Leu 
Ser Leu 
Leu Ala 
Phe Glu 
Ala Asp 
Val Arg 
Gly Gly 
Leu Arg 



Leu Pro 

405 
Ala Asp 

420 
Gly Ser 

435 
Tyr Met 

450 
Asn Val 

465 
Pro Ser 

480 
Ser Glu 

495 
Asp Leu 

510 
Pro Asp 

525 
Tyr Gin 

540 



<210> 17 
<211> 72 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7500723CD1 



<400> 17 

Met Gly Val Pro Thr Ala Pro Glu Ala Gly Ser Trp Arg Trp Gly 

15 10 15 

Ser Leu Leu Phe Ala Leu Phe Leu Ala Ala Ser Leu Gly Pro Ser 

20 25 30 

Pro Ala Pro Trp Arg Pro Pro Gly Cys Gin His Gin Pro Arg Pro 

35 40 45 

Gly Ser Ala Pro Arg Ala Gly Val Gly Leu Arg Pro Pro Trp Gin 

50 55 60 

Leu Leu His His His Ala Gin Pro Asp Pro Ala Gly 

65 70 



<210> 18 
<211> 22 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7500764CD1 



<400> 18 

Met Ser Trp Pro Arg Arg Leu Leu Leu Arg Tyr Leu Phe Pro Ala 
15 10 15 
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Leu Leu Leu His Ala Val Lys 
20 

<210> 19 

<211> 26 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7500772CD1 

<400> 19 

Met Phe Leu Lys Ala Val Val Leu Thr Leu Ala Leu Val Ala Val 
15 10 15 

Ala Val Pro Ser Ser Arg Thr Asn Leu Glu Lys 
20 25 

<210> 20 

<211> 27 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£ eature 

<223> Incyte ID No: 7501350CD1 

<400> 20 

Met lie Pro Ala Val Val Leu Leu Leu Leu Leu Leu Val Glu Gin 
15 10 15 

Ala Asp Pro Ser Ala Lys Gly Ser Tyr Asn Gin Leu 
20 25 

<210> 21 

<211> 253 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7506396CD1 

<400> 21 

Met Pro Gin Met Arg Gin Thr Pro Thr Asp Lys Pro Leu Cys Pro 
15 10 15 

Ser Arg Thr His Lys Val Leu Pro lie Leu Glu lie Leu Tyr His 

20 25 30 

Val Glu Glu Arg Asn Ser His His Val Tyr Met Ala Leu He He 

35 40 45 

Leu Leu He Leu Thr Glu Asp Asp Gly Phe Asn Arg Ser He His 

50 55 60 

Glu Val He Leu Lys Asn He Thr Trp Tyr Ser Glu Arg Val Leu 

65 70 75 

Thr Glu He Ser Leu Gly Ser Leu Leu He Leu Val Val He Arg 

80 85 90 

Thr He Gin Tyr Asn Met Thr Arg Thr Arg Asp Lys Tyr Leu His 
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95 



100 



105 



Thr Asn Cys Leu Ala Ala Leu Ala Asn Met Ser Ala Gin Phe Arg 

110 115 120 

Ser Leu His Gin Tyr Ala Ala Gin Arg He He Ser Leu Phe Ser 

125 130 135 

Leu Leu Ser Lys Lys His Asn Lys Val Leu Glu Gin Ala Thr Gin 

140 145 150 

Ser Leu Arg Gly Ser Leu Ser Ser Asn Asp Val Pro Leu Pro Asp 

155 160 165 

Tyr Val He Ser Phe Phe Ser Ser Arg Leu Leu Gin Ala Gly Ala 

170 175 180 

Glu Leu Ser Val Glu Arg Val Leu Glu He He Lys Gin Gly Val 

185 190 195 

Val Ala Leu Pro Lys Asp Arg Leu Lys Lys Phe Pro Glu Leu Lys 

200 205 210 

Phe Lys Tyr Val Glu Glu Glu Gin Pro Glu Glu Phe Phe He Pro 

215 220 225 

Tyr Val Trp Ser Leu Val Tyr Asn Ser Ala Val Gly Leu Tyr Trp 

230 235 240 

Asn Pro Gin Asp He Gin Leu Phe Thr Met Asp Ser Asp 



<210> 22 

<211> 511 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7505917CD1 

<400> 22 

Met Ala Ala Glu Gin Asp Pro Glu Ala Arg Ala Ala Ala Arg Pro 
15 10 15 

Leu Leu Thr Asp Leu Tyr Gin Ala Thr Met Ala Leu Gly Tyr Trp 
20 25 30 

Arg Ala Gly Arg Ala Arg Asp Ala Ala Glu Phe Glu Leu Phe Phe 
35 40 45 

Arg Arg Cys Pro Phe Gly Gly Ala Phe Ala Leu Ala Ala Gly Leu 
50 55 60 

Arg Asp Cys Val Arg Phe Leu Arg Ala Phe Arg Leu Arg Asp Ala 
65 70 75 

Asp Val Gin Phe Leu Ala Ser Val Leu Pro Pro Asp Thr Asp Pro 
80 85 90 

Ala Phe Phe Glu His Leu Arg Ala Leu Asp Cys Ser Glu Val Thr 
95 100 105 

Val Arg Ala Leu Pro Glu Gly Ser Leu Ala Phe Pro Gly Val Pro 
110 115 120 

Leu Leu Gin Val Ser Gly Pro Leu Leu Val Val Gin Leu Leu Glu 
125 130 135 

Thr Pro Leu Leu Cys Leu Val Ser Tyr Ala Ser Leu Val Ala Thr 
140 145 150 

Asn Ala Ala Arg Leu Arg Leu He Ala Gly Pro Glu Lys Arg Leu 
155 160 165 

Leu Glu Met Gly Leu Arg Arg Ala Gin Gly Pro Asp Gly Gly Leu 



245 



250 



170 



175 



180 
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Thr Ala 


Ser Thr 


Tyr 


Ser 


Tyr 


Leu Gly Gly 


Phe Asp Ser 


Ser Ser 






185 










190 




195 


Asn Val 


Leu Ala 


Gly 


Gin 


Leu 


Arg Gly Val 


Pro Val Ala 


Gly Thr 






200 










205 




210 


Leu Ala 


His Ser 


Phe 


Val 


Thr 


Ser 


Phe 


Ser 


Gly Ser Glu 


Val Pro 






215 










220 




225 


Pro Asp 


Pro Met 


Leu 


Ala 


Pro 


Ala Ala Gly 


Glu Gly Pro 


Gly Val 






230 










235 




240 


Asp Leu 


Ala Ala 


Lys 


Ala 


Gin 


Val 


Trp Leu 


Glu Gin Val 


Cys Ala 






245 










250 




255 


His Leu 


Gly Leu 


Gly 


Val 


Gin 


Glu 


Pro His 


Pro Gly Glu 


Arg Ala 






260 










265 




270 


Ala Phe 


Val Ala 


Tyr 


Ala 


Leu 


Ala 


Phe 


Pro 


Arg Ala Phe 


Gin Gly 






275 










280 




285 


Leu Leu 


Asp Thr 


Tyr 


Ser 


Val 


Trp Arg Ser 


Gly Leu Pro 


Asn Phe 






290 










295 




300 


Leu Ala 


Val Ala 


Leu 


Ala 


Leu 


Gly Glu Leu 


Gly Tyr Arg 


Ala Val 






305 










310 




315 


Gly Val 


Arg Leu 


Asp 


Ser 


Gly 


Asp Leu 


Leu 


Gin Gin Ala 


Gin Glu 






320 










325 




330 


lie Arg 


Lys Val 


Phe 


Arg 


Ala 


Ala 


Ala 


Ala 


Gin Phe Gin 


Val Pro 






335 










340 




345 


Trp Leu 


Glu Ser 


Val 


Leu 


lie 


Val 


Val 


Ser 


Asn Asn lie 


Asp Glu 






350 










355 




360 


Glu Ala 


Leu Ala 


Arg 


Leu 


Ala 


Gin 


Glu 


Leu 


Val Ala Val 


Gly Gly 






365 










370 




375 


Gin Pro 


Arg Met 


Lys 


Leu 


Thr 


Glu Asp Pro 


Glu Lys Gin 


Thr Leu 






380 










385 




390 


Pro Gly 


Ser Lys 


Ala 


Ala 


Phe 


Arg 


Leu 


Leu 


Gly Ser Asp 


Gly Ser 






395 










400 




405 


Pro Leu 


Met Asp 


Met 


Leu 


Gin 


Leu 


Ala 


Glu 


Glu Pro Val 


Pro Gin 






410 










415 




420 


Ala Gly 


Gin Glu 


Leu 


Arg 


Val 


Trp 


Pro 


Pro 


Gly Ala Gin 


Glu Pro 






425 










430 




435 


Cys Thr 


Val Arg 


Pro 


Ala 


Gin 


Val 


Glu 


Pro 


Leu Leu Arg 


Leu Cys 






440 










445 




450 


Leu Gin 


Gin Gly 


Gin 


Leu 


Cys 


Glu 


Pro 


Leu 


Pro Ser Leu 


Ala Glu 






455 










460 




465 


Ser Arg 


Ala Leu 


Ala 


Gin 


Leu 


Ser 


Leu 


Ser 


Arg Leu Ser 


Pro Glu 






470 










475 




480 


His Arg 


Arg Leu 


Arg 


Ser 


Pro 


Ala Gin Tyr 


Gin Val Val 


Leu Ser 






485 










490 




495 


Glu Arg 


Leu Gin 


Ala 


Leu 


Val 


Asn 


Ser 


Leu 


Cys Ala Gly 


Gin Ser 






500 










505 




510 



Pro 



<210> 23 

<211> 127 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7500701CD1 
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<400> 23 



Met 


Gly 


His 


Arg Phe 


Leu 


Arg 


Gly Leu Leu 


Thr Leu Leu Leu Pro 


1 






5 






10 


15 


Pro 


Pro 


Pro 


Leu Tyr 


Thr 


Arg 


His Arg Met 


Leu Gly Pro Glu Ser 








20 






25 


30 


Val 


Pro 


Pro 


Pro Lys 


Arg 


Ser 


Arg Ser Lys 


Leu Met Ala Pro Pro 








35 






40 


45 


Arg 


lie 


Gly 


Thr His 


Asn 


Gly 


Thr Phe His 


Cys Asp Glu Ala Leu 








50 






55 


€0 


Ala 


Cys 


Ala 


Leu Leu 


Arg 


Leu 


Leu Pro Glu 


Tyr Arg Asp Ala Glu 








65 






70 


75 


lie 


Val 


Arg 


Thr Arg 


Asp 


Pro 


Glu Lys Leu 


Ala Ser Cys Asp lie 








80 






85 


90 


Val 


Val 


Asp 


Val Gly 


Gly 


Glu 


Tyr Asp Pro 


Arg Arg His Arg Tyr 








95 






100 


105 


Asp 


His 


His 


Gin Arg 


Gin 


Gly 


Ser Ser Val 


Gin Trp lie Trp Phe 








110 






115 


120 


Lys 


Arg 


Ser 


Phe Cys 


Arg 


Asp 












125 











<210> 24 

<211> 137 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7500702CD1 

<400> 24 



Met 


Gly 


His Arg 


Phe 


Leu Arg Gly Leu Leu 


Thr Leu Leu Leu Pro 


1 






5 


10 


15 


Pro 


Pro 


Pro Leu 


Tyr 


Thr Arg His Arg Met 


Leu Gly Pro Glu Ser 








20 


25 


30 


Val 


Pro 


Pro Pro 


Lys 


Arg Ser Arg Ser Lys 


Leu Met Ala Pro Pro 








35 


40 


45 


Arg 


lie 


Gly Thr 


His 


Asn Gly Thr Phe His 


Cys Asp Glu Ala Leu 








50 


55 


60 


Ala 


Cys 


Ala Leu 


Leu 


Arg Leu Leu Pro Glu 


Tyr Arg Asp Ala Glu 








65 


70 


75 


lie 


Val 


Arg Thr 


Arg 


Asp Pro Glu Lys Leu 


Ala Ser Cys Asp lie 








80 


85 


90 


Val 


Val 


Asp Val 


Gly 


Gly Glu Tyr Asp Pro 


Arg Arg His Arg Tyr 








95 


100 


105 


Asp 


His 


His Gin 


Arg 


Cys Met Arg Thr Leu 


Trp Arg Arg Trp Met 








110 


115 


120 


Leu 


Trp 


Thr Met 


Gly 


Ser Pro Ser Gly Gin 


Arg Gly Ser Leu Asp 








125 


130 


135 


Met 


His 











<210> 25 

<211> 207 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_£eature 

<223> Incyte ID No: 6044343CD1 



<400> 25 

Met Pro Gly Val Leu His Leu His Val Arg Ala Leu Pro Arg His 
15 10 15 

Arg Glu Pro Ala Gly Leu Leu Ala Gin Arg Gin lie Val Pro Arg 
20 25 30 

Gly Gin Ser Leu Val Val Cys Val Ser Pro Phe Ser Pro Leu Trp 
35 40 45 

Gly Ser Glu Pro Pro Ala Leu Leu Leu Trp Val His Gin Ser Pro 
50 55 60 

Pro Arg Cys Pro His Ala Asp Ala His Arg Pro Ala Arg Gly Asn 
65 70 75 

Thr Asp Met Arg Thr Arg Arg Pro Thr His Arg Arg Thr Gin Ser 
80 85 90 

Cys Thr Trp Arg His Val Arg Thr Arg Arg Pro Thr Arg Arg Pro 
95 100 105 

Thr His Arg Cys Thr Arg Thr Cys Thr Trp Arg His Arg Cys Val 
110 115 120 

His Met Gin Ala His Thr Gin Val Ser Val Asp Leu Cys Glu Gin 
125 130 135 

Met Arg Ala Asp Leu His Val Glu Thr Gin Thr Cys Arg Arg Thr 
140 145 150 

Lys Arg Ser Leu Lys Ala Asp Thr Gin Glu Gly Ala Ser His Leu 
155 160 165 

Trp Asp Ser Arg Asp Ala Val Ala Asn Thr Ser Val Val Pro Ser 
170 175 180 

Ser Leu Leu Asp Lys Thr Glu Ala Cys Ser Gly His Thr Trp Glu 
185 190 195 

Ala Glu Pro Arg Asp Trp Ala Gin Glu Pro Phe Cys 
200 205 



<210> 26 

<211> 1008 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 750399OCD1 



<400> 26 

Met Ala Ala Met Ala Pro Ala Leu Thr Asp Ala Ala Ala Glu Ala 

15 10 15 

His His lie Arg Phe Lys Leu Ala Pro Pro Ser Ser Thr Leu Ser 

20 25 30 

Pro Gly Ser Ala Glu Asn Asn Gly Asn Ala Asn lie Leu lie Ala 

35 40 45 

Ala Asn Gly Thr Lys Arg Lys Ala lie Ala Ala Glu Asp Pro Ser 

50 55 60 

Leu Asp Phe Arg Asn Asn Pro Thr Lys Glu Asp Leu Gly Lys Leu 

65 70 75 

Gin Pro Leu Val Ala Ser Tyr Leu Cys Ser Asp Val Thr Ser Val 

80 85 90 
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Pro Ser Lys Glu Ser Leu Lys Leu Gin Gly Val Phe Ser Lys Gin 
95 100 105 

Thr Val Leu Lys Ser His Pro Leu Leu Ser Gin Ser Tyr Glu Leu 

110 115 120 

Arg Ala Glu Leu Leu Gly Arg Gin Pro Val Leu Glu Phe Ser Leu 

125 130 135 

Glu Asn Leu Arg Thr Met Asn Thr Ser Gly Gin Thr Ala Leu Pro 

140 145 150 

Gin Ala Pro Val Asn Gly Leu Ala Lys Lys Leu Thr Lys Ser Ser 

155 160 165 

Thr His Ser Asp His Asp Asn Ser Thr Ser Leu Asn Gly Gly Lys 

170 175 180 

Arg Ala Leu Thr Ser Ser Ala Leu His Gly Gly Glu Met Gly Gly 

185 190 195 

Ser Glu Ser Gly Asp Leu Lys Gly Gly Met Thr Asn Cys Thr Leu 

200 205 210 

Pro His Arg Ser Leu Asp Val Glu His Thr lie Leu Tyr Ser Asn 

215 220 225 

Asn Ser Thr Ala Asn Lys Ser Ser Val Asn Ser Met Glu Gin Pro 

230 235 240 

Ala Leu Gin Gly Ser Ser Arg Leu Ser Pro Gly Thr Asp Ser Ser 

245 250 255 

Ser Asn Leu Gly Gly Val Lys Leu Glu Gly Lys Lys Ser Pro Leu 

260 265 270 

Ser Ser lie Leu Phe Ser Ala Leu Asp Ser Asp Thr Arg lie Thr 

275 280 285 

Ala Leu Leu Arg Arg Gin Ala Asp lie Glu Ser Arg Ala Arg Arg 

290 295 300 

Leu Gin Lys Arg Leu Gin Val Val Gin Ala Lys Gin Val Glu Arg 

305 310 315 

His lie Gin His Gin Leu Gly Gly Phe Leu Glu Lys Thr Leu Ser 

320 325 330 

Lys Leu Pro Asn Leu Glu Ser Leu Arg Pro Arg Ser Gin Leu Met 

335 340 345 

Leu Thr Arg Lys Ala Glu Ala Ala Leu Arg Lys Ala Ala Ser Glu 

350 355 360 

Thr Thr Thr Ser Glu Gly Leu Ser Asn Phe Leu Lys Ser Asn Ser 

365 370 375 

He Ser Glu Glu Leu Glu Arg Phe Thr Ala Ser Gly lie Ala Asn 

380 385 390 

Leu Arg Cys Ser Glu Gin Ala Phe Asp Ser Asp Val Thr Asp Ser 

395 400 405 

Ser Ser Gly Gly Glu Ser Asp lie Glu Glu Glu Glu Leu Thr Arg 

410 415 420 

Ala Asp Pro Glu Gin Arg His Val Pro Leu Arg Arg Arg Ser Glu 

425 430 435 

Trp Lys Trp Ala Ala Asp Arg Ala Ala lie Val Ser Arg Trp Asn 

440 445 450 

Trp Leu Gin Ala His Val Ser Asp Leu Glu Tyr Arg He Arg Gin 

455 460 465 

Gin Thr Asp He Tyr Lys Gin He Arg Ala Asn Lys He Glu Ser 

470 475 480 

Val Ser Gin Pro Leu Glu Asn His Gly Ala Pro He He Gly His 

485 490 495 

He Ser Glu Ser Leu Ser Thr Lys Ser Cys Gly Ala Leu Arg Pro 

500 505 510 
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Val Asn Gly Val He Asn Thr Leu Gin Pro Val Leu Ala Asp His 

515 520 525 

He Pro Gly Asp Ser Ser Asp Ala Glu Glu Gin Leu His Lys Lys 

530 535 540 

Gin Arg Leu Asn Leu Val Ser Ser Ser Ser Asp Gly Thr Cys Val 

545 550 555 

Ala Ala Arg Thr Arg Pro Val Leu Ser Cys Lys Lys Arg Arg Leu 

560 565 570 

Val Arg Pro Asn Ser He Val Pro Leu Ser Lys Lys Val His Arg 

575 580 585 

Asn Ser Thr He Arg Pro Gly Cys Asp Val Asn Pro Ser Cys Ala 

590 595 600 

Leu Cys Oly Ser Gly Ser He Asn Thr Met Pro Pro Glu He His 

605 610 615 

Tyr Glu Ala Pro Leu Leu Glu Arg Leu Ser Gin Leu Asp Ser Cys 

620 625 630 

Val His Pro Val Leu Ala Phe Pro Asp Asp Val Pro Thr Ser Leu 

635 640 645 

His Phe Gin Ser Met Leu Lys Ser Gin Trp Gin Asn Lys Pro Phe 

650 655 660 

Asp Lys He Lys Pro Pro Lys Lys Leu Ser Leu Lys His Arg Ala 

665 670 675 

Pro Met Pro Gly Ser Leu Pro Asp Ser Ala Arg Lys Asp Arg His 

680 685 690 

Lys Leu Val Ser Ser Phe Leu Thr Thr Ala Met Leu Lys His His 

695 700 705 

Thr Asp Met Ser Ser Ser Ser Tyr Leu Ala Ala Thr His His Pro 

710 715 720 

Pro His Ser Pro Leu Val Arg Gin Leu Ser Thr Ser Ser Asp Ser 

725 730 735 

Pro Ala Pro Ala Ser Ser Ser Ser Gin Val Thr Ala Ser Thr Ser 

740 745 750 

Gin Gin Pro Val Arg Arg Arg Arg Gly Glu Ser Ser Phe Asp He 

755 760 765 

Asn Asn He Val He Pro Met Ser Val Ala Ala Thr Thr Arg Val 

770 775 780 

Glu Lys Leu Gin Tyr Lys Glu He Leu Thr Pro Ser Trp Arg Glu 

785 790 795 

Val Asp Leu Gin Ser Leu Lys Gly Ser Pro Asp Glu Glu Asn Glu 

800 805 810 

Glu He Glu Asp Leu Ser Asp Ala Ala Phe Ala Ala Leu His Ala 

815 820 825 

Lys Cys Glu Glu Met Glu Arg Ala Arg Trp Leu Trp Thr Thr Ser 

830 835 840 

Val Pro Pro Gin Arg Arg Gly Ser Arg Ser Tyr Arg Ser Ser Asp 

845 850 855 

Gly Arg Thr Thr Pro Gin Leu Gly Ser Ala Asn Pro Ser Thr Pro 

860 865 870 

Gin Pro Ala Ser Pro Asp Val Ser Ser Ser His Ser Leu Ser Glu 

875 880 885 

Tyr Ser His Gly Gin Ser Pro Arg Ser Pro He Ser Pro Glu Leu 

890 895 900 

His Ser Ala Pro Leu Thr Pro Val Ala Arg Asp Thr Leu Arg His 

905 910 915 

Leu Ala Ser Glu Asp Thr Arg Cys Ser Thr Pro Glu Leu Gly Leu 

920 925 930 
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Asp Glu 


Gin 


Ser 


Val 


Gin Pro Trp Glu Arg 


Arg Thr Phe Pro Leu 








935 


940 


945 


Ala His 


Ser 


Pro 


Gin 


Ala Glu Cys Glu Asp 


Gin Leu Asp Ala Gin 








950 


955 


960 


Glu Arg 


Ala 


Ala 


Arg 


Cys Thr Arg Arg Thr 


Ser Gly Ser Lys Thr 








965 


970 


975 


Gly Arg 


Glu 


Thr 


Glu 


Ala Ala Pro Thr Ser 


Pro Pro lie Val Pro 








980 


985 


990 


Leu Lys 


Ser 


Arg 


His 


Leu Val Ala Ala Ala 


Thr Ala Gin Arg Pro 








995 


1000 


1005 



Thr His Arg 



<210> 27 

<211> 70 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 7504655CD1 

<400> 27 

Met Val Gly Pro Ala Pro Arg Arg Arg Leu Arg Pro Leu Ala Ala 
15 10 15 

Leu Ala Leu Val Leu Ala Leu Ala Pro Gly Leu Pro Thr Ala Arg 

20 25 30 

Ala Gly Gin Thr Pro Arg Pro Ala Glu Arg Gly Pro Pro Val Arg 

35 40 45 

Leu Phe Thr Glu Glu Glu Leu Ala Arg Tyr Gly Gly Glu Glu Ser 

50 55 60 

Phe Met Asp Glu Glu Pro Pro Thr Met Pro 

65 70 

<210> 28 
<211> 142 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_ feature 

<223> Incyte ID No: 7504690CD1 

<400> 28 

Met Leu Leu Trp Pro Leu Leu Leu Leu Leu Leu Leu Leu Pro Thr 
15 10 15 

Leu Ala Leu Leu Arg Gin Gin Arg Ser Gin Asp Ala Arg Leu Ser 

20 25 30 

Trp Leu Ala Gly Leu Gin His Arg Val Ala Trp Gly Ala Leu Val 

35 40 45 

Trp Ala Ala Thr Trp Gin Arg Arg Arg Leu Glu Gin Ser Thr Leu 

50 55 60 

His Val His Gin Ser Gin Gin Gin Ala Leu Arg Trp Cys Leu Gin 

65 70 75 

Gly Ala Gin Arg Pro His Cys Ser Leu Arg Arg Ser Thr Asp lie 

80 85 90 
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Ser Thr Phe Arg Asn His Leu Pro Leu Thr Lys Ala Ser Gin Thr 
95 100 105 

Gin Gin Glu Asp Ser Gly Arg Gly Pro Ser Cys Trp Thr Met Ala 

110 115 120 

Val Trp Arg Ala Ala Phe Trp lie Pro Leu Arg Ala Leu Leu Pro 

125 130 135 

Thr Thr Arg Cys Leu Trp Arg 

140 

<210> 29 

<211> 43 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504720CD1 

<400> 29 

Met Ala Ser Arg Leu Thr Leu Leu Thr Leu Leu Leu Leu Leu Leu 
15 10 15 

Ala Gly Gly Leu Gly Arg Thr Pro Lys Gin Thr Trp Arg Ala Ser 

20 25 30 

Ser Leu Thr Pro Arg Thr Ser Pro Val Ser Thr Arg Pro 

35 40 

<210> 30 

<211> 64 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504722CD1 

<400> 30 

Met Ala Ser Gly Ser Asn Trp Leu Ser Gly Val Asn Val Val Leu 
15 10 15 

Val Met Ala Tyr Gly Ser Leu Val Phe Val Leu Leu Phe He Phe 

20 25 30 

Val Lys Arg Gin He Met Arg Phe Ala Met Lys Ser Arg Arg Gly 

35 40 45 

Pro His Val Pro Val Gly His Asn Ala Pro Lys Val Ala Thr Thr 

50 55 60 

He Cys He Gly 



<210> 31 

<211> 23 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7504733CD1 
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<400> 31 

Met Lys Leu Leu Ala Ala Thr Val Leu Leu Leu Thr lie Cys Ser 

15 10 15 
Leu Glu Val Leu Pro Asp Arg Asp 
20 

<210> 32 

<211> 349 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7507100CD1 

<400> 32 

Met Arg Gly Glu Gly Gly Ala Gly Arg Cys Arg Gly Tyr Trp Glu 

15 10 15 

Arg He Ser Arg Gly Arg Gly Val Val Pro Gly Arg Gly Ala Ala 

20 25 30 

Ala Gly Ser Ala Trp Pro Pro Gly Ala Met Ala Thr Leu Ser Phe 

35 40 45 

Val Phe Leu Leu Leu Gly Ala Val Ser Trp Pro Pro Ala Ser Ala 

50 55 60 

Ser Gly Gin Glu Phe Trp Pro Gly Gin Ser Ala Ala Asp He Leu 

65 70 75 

Ser Gly Ala Ala Ser Arg Arg Arg Tyr Leu Leu Tyr Asp Val Asn 

80 85 90 

Pro Pro Glu Gly Phe Asn Leu Arg Arg Asp Val Tyr He Arg He 

95 100 105 

Ala Ser Leu Leu Lys Thr Leu Leu Lys Thr Glu Glu Trp Val Leu 

110 115 120 

Val Leu Pro Pro Trp Gly Arg Leu Tyr His Trp Gin Ser Pro Asp 

125 130 135 

He His Gin Val Arg He Pro Trp Ser Glu Phe Phe Asp Leu Pro 

140 145 150 
Ser Leu Asn Lys Asn He Pro Val He Glu Tyr Glu Gin Phe He 

155 160 165 

Ala Glu Ser Gly Gly Pro Phe He Asp Gin Val Tyr Val Leu Gin 

170 175 180 

Ser Tyr Ala Glu Gly Trp Lys Glu Gly Thr Trp Glu Glu Lys Val 

185 190 195 
Asp Glu Arg Pro Cys He Asp Gin Leu Leu Tyr Ser Gin Asp Lys 

200 205 210 

His Glu Tyr Tyr Arg Cys Leu Leu Arg Leu Leu Pro Leu Pro Gin 

215 220 225 
Gly Ser Ala Ser He Val Ala Pro Leu Leu Leu Arg Asn Thr Ser 

230 235 240 
Ala Arg Ser Val Met Leu Asp Arg Ala Glu Asn Leu Leu His Asp 

245 250 255 
His Tyr Gly Gly Lys Glu Tyr Trp Asp Thr Arg Arg Ser Met Val 

260 265 270 
Phe Ala Arg His Leu Arg Glu Val Gly Asp Glu Phe Arg Ser Arg 

275 280 285 
His Leu Asn Ser Thr Asp Asp Ala Asp Arg He Pro Phe Gin Glu 

290 295 300 
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Asp Trp Met Lys Met Lys Val Lys Leu Gly Ser Ala Leu Oly Gly 
305 310 315 

Pro Tyr Leu Gly Val His Leu Arg Arg Lys Asp Phe lie Trp Gly 
320 325 330 

His Arg Gin Asp Val Pro Ser Leu Glu Gly Ala Gly Gly Thr Gly 
335 340 345 

Arg Pro Leu Ala 



<210> 33 

<211> 65 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7503330CD1 



<400> 33 

Met Phe Val Ala Ser Glu Arg Lys Met Arg Ala His Gin Val Leu 
15 10 15 

Thr Phe Leu Leu Leu Phe Val He Thr Ser Val Ala Ser Glu His 

20 25 30 

Ala Gin Leu Leu Arg His Val Ala Ala Gin Asp Ala Gly Asp Gly 

35 40 45 

Leu Arg Gly Gly Arg Ala Ala Ala Ala Gly Leu Tyr Gly Glu Gin 

50 55 60 

Gly Leu Leu His Gly 

65 



<210> 34 

<211> 64 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504519CD1 



<400> 34 

Met Asp His Cys Gly Ala Leu Phe Leu Cys Leu Cys Leu Leu Thr 
15 10 15 

Leu Gin Asn Ala Thr Thr Glu Thr Trp Glu Glu Leu Leu Ser Tyr 

20 25 30 

Met Glu Asn Met Gin Val Ser Arg Gly Arg Ser Ser Val Phe Ser 

35 40 45 

Ser Arg Gin Glu Val Ser Met Pro Gly Val Ser Thr Pro Cys Ser 

50 55 60 

Ser Pro Pro Ser 



<210> 35 

<211> 94 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7504705CD1 

<400> 35 

Met Lys Ser Leu lie Leu Leu Ala lie Leu Ala Ala Leu Ala Val 
15 10 15 

Val Thr Leu Cys Tyr Glu Ser His Glu Ser Met Glu Ser Tyr Glu 

20 25 30 

Leu Arg Ser Glu Asn Ala Leu Ser Leu Ser Thr Ser Ser lie Gly 

35 40 45 

Lys Pro Val Met Thr Thr Asp Phe Ala Asn Ala Thr Pro Trp Phe 

50 55 60 

Met Asp Thr Met Leu Pro He He Ala Thr Ser Gly Ser Ala Glu 

65 70 75 

Gly Pro Asn Glu Thr Glu Gly Arg Lys Lys He Ser Phe Phe Leu 

80 85 90 

Glu Ala Gly Thr 



<210> 36 

<211> 52 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504738CD1 

<400> 36 

Met Lys Leu Cys Val Thr Val Leu Ser Leu Leu Met Leu Val Ala 
15 10 15 

Ala Phe Cys Ser Pro Ala Leu Ser Ala Pro Asn Ser Lys Pro Lys 

20 25 30 

Glu Ala Ser Lys Ser Val Leu He Pro Val Asn Pro Gly Ser Arg 

35 40 45 

Ser Thr Cys Met Thr Trp Asn 

50 

<210> 37 

<211> 33 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510280CD1 

<400> 37 

Met Asp Asn Val Gin Pro Lys He Lys His Arg Pro Phe Cys Phe 

15 10 15 

Ser Val Lys Gly His Val Lys Met Leu Arg Leu His Cys Leu Val 

20 25 30 

Val Phe Met 
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<210> 38 

<211> 242 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7503700CD1 

<400> 38 

Met Ser Pro Trp Ser Trp Phe Leu Leu Gin Thr Leu Cys Leu Leu 
15 10 15 

Pro thr Gly Ala Ala Ser Arg Arg Gly Ala Pro Gly Thr Ala Asn 
20 25 30 

Cys Glu Leu Lys Pro Gin Gin Ser Glu Leu Asn Ser Phe Leu Trp 
35 40 45 

Thr lie Lys Arg Asp Pro Pro Ser Tyr Phe Phe Gly Thr lie His 
50 55 60 

Val Pro Tyr Thr Arg Val Trp Asp Phe lie Pro Asp Asn Ser Lys 
65 70 75 

Glu Ala Phe Leu Gin Ser Ser lie Val Tyr Phe Glu Leu Asp Leu 
80 85 90 

Thr Asp Pro Tyr Thr lie Ser Ala Leu Thr Ser Cys Gin Met Leu 
95 100 105 

Pro Gin Gly Glu Asn Leu Gin Asp Val Leu Pro Arg Asp lie Tyr 
110 115 120 

Cys Arg Leu Lys Arg His Leu Glu Tyr Val Lys Leu Met Met Pro 
125 130 135 

Leu Trp Met Thr Pro Asp Gin Arg Gly Lys Gly Leu Tyr Ala Asp 
140 145 150 

Tyr Leu Phe Asn Ala lie Ala Gly Asn Trp Glu Arg Lys Arg Pro 
155 160 165 

Val Trp Val Met Leu Met Val Asn Ser Leu Thr Glu Val Asp lie 
170 175 180 

Lys Ser Arg Gly Val Pro Val Leu Asp Leu Phe Leu Ala Gin Glu 
185 190 195 

Ala Glu Arg Leu Arg Lys Gin Thr Gly Ala Val Glu Lys Val Glu 
200 205 210 

Glu Gin Cys His Pro Leu Asn Gly Leu Asn Phe Ser Gin Val lie 
215 220 225 

Ser Trp Ala Thr Thr Gin Cys Trp Met Phe Cys Gly Val Lys Ala 
230 235 240 

Met Arg 



<210> 39 

<211> 47 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504685CD1 

<400> 39 

Met Ala Ser Thr Val Val Ala Val Gly Leu Thr He Ala Ala Ala 
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15 10 15 

Gly Phe Ala Gly Arg Tyr Val Leu Gin Ala Met Lys His Met Glu 

20 25 30 

Pro Gin Val Lys Gin Val Phe Gin Ser Leu Pro Lys Ser Pro Tyr 

35 40 45 

Cys Gin 



<210> 40 

<211> 53 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7506844CD1 



<400> 40 

Met Ser Ser Phe Gly Tyr Arg Thr Leu Thr Val Ala Leu Phe Thr 
15 10 15 

Leu lie Cys Cys Pro Gly Val Asn Glu Phe Gin Arg Gin Arg Val 

20 25 30 

Pro Ala Ser Lys Ala Gly His Pro Asp Thr Ala Thr His Phe Gly 

35 40 45 

Gly Cys Gly Gin Val Leu His His 

50 



<210> 41 

<211> 95 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510259CD1 



<400> 41 

Met Met Tyr Ala Pro Val Glu Phe Ser Glu Ala Glu Phe Ser Arg 
15 10 15 

Ala Glu Tyr Gin Arg Lys Gin Gin Phe Trp Asp Ser Val Arg Leu 

20 25 30 

Ala Leu Phe Thr Leu Ala He Val Ala He He Gly He Ala He 

35 40 45 

Gly He Val Thr His Phe Val Val Glu Asp Asp Lys Ser Phe Tyr 

50 55 60 

Tyr Leu Ala Ser Phe Lys Val Thr Asn He Lys Tyr Lys Glu Asn 

65 70 75 

Tyr Gly He Arg Ser Ser Arg Glu Phe He Glu Arg Ser His Gin 

80 85 90 

He Glu Arg Met Arg 

95 



<210> 42 

<211> 57 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7510444CD1 



<400> 42 

Met Val Gly Gin Arg Val Leu Leu Leu Val Ala Phe Leu Leu Ser 

15 10 15 

Gly Val Leu Leu Ser Glu Ala Ala Lys lie Leu Thr lie Ser Thr 

20 25 30 

Leu Gly Glu Cys Leu Ala Gly Glu Phe Pro Asp Arg Arg Val Pro 

35 40 45 
Asp Pro Arg Thr Ala Arg Ala Pro Ala Asn Gly Asp 

50 55 



<210> 43 

<211> 67 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510494CD1 

<400> 43 

Met Pro Gly Arg Ser Cys Val Ala 

1 5 
Val Ser Cys Ala Val Ala Gin His 
20 

Glu Pro Arg Arg Arg Glu Gly Arg 
35 

Lys lie Asn Leu Ser Ser Phe Arg 
50 

Ser Met Val His His Lys Ser 
65 



Leu Val Leu Leu Ala Ala Ala 

10 15 
Ala Pro Pro Val Ser Glu Leu 

25 30 
Ala Val Asp Arg Gly Leu Gin 

40 45 
Thr Asn Val Gin Arg Cys Ser 

55 60 



<210> 44 

<211> 311 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 6486485CD1 



<400> 44 

Met Ala Phe Leu Val Ala His Pro Met Gin Phe Val Tyr Leu Leu 
15 10 15 

Thr Thr Leu Cys Val Phe Asn Met Val Phe Ala Lys Leu Gly Phe 

20 25 30 

Ser Glu Thr Val Phe Ser Gin Arg Leu Ser Phe Thr Val Leu Ser 

35 40 45 

Ala Val Gly Tyr Phe Gin Trp Gin Lys Arg Pro His Leu Leu Pro 

50 55 60 

Val Gly Pro Leu Gly Arg Ser Met Arg Trp Cys Leu Leu Leu lie 

65 70 75 

Trp Ala Gin Gly Leu Arg Gin Ala Pro Leu Ala Ser Gly Met Met 
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80 






85 




90 


Thr 


Gly Thr lie 


Glu 


Thr 


Thr 


Gly Asn He 


Ser Ala Glu Lys 


Gly 






95 






100 




105 


Gly 


Ser He He 


Leu 


Gin 


Cys 


His Leu Ser 


Ser Thr Thr Ala 


Gin 






110 






115 




120 


Val 


Thr Gin Val 


Asn 


Trp 


Glu 


Gin Gin Asp 


Gin Leu Leu Ala 


He 






125 






130 




135 


Cys 


Asn Ala Asp 


Leu 


Gly 


Trp 


His He Ser 


Pro Ser Phe Lys 


Asp 






140 






145 




150 


Arg 


Val Ala Pro 


Gly 


Pro 


Gly 


Leu Gly Leu 


Thr Leu Gin Ser 


Leu 






155 






160 




165 


Thr 


Val Asn Asp 


Thr 


Gly 


Glu 


Tyr Phe Cys 


He Tyr His Thr 


Tyr 






170 






175 




180 


Pro 


Asp Gly Thr 


Tyr 


Thr 


Gly 


Arg He Phe 


Leu Glu Val Leu 


Glu 






185 






190 




195 


Ser 


Ser Val Ala 


Glu 


His 


Gly 


Ala Arg Phe 


Gin He Pro Leu 


Leu 






200 






205 




210 


Gly 


Ala Met Ala 


Ala 


Thr 


Leu 


Val Val He 


Cys Thr Ala Val 


He 






215 






220 




225 


Val 


Val Val Ala 


Leu 


Thr 


Arg 


Lys Lys Lys 


Ala Leu Arg lie 


His 






230 






235 




240 


Ser 


Val Glu Gly 


Asp 


Leu 


Arg 


Arg Lys Ser 


Ala Gly Gin Glu 


Glu 






245 






250 




255 


Trp 


Ser Pro Ser 


Ala 


Pro 


Ser 


Pro Pro Gly 


Ser Cys Val Gin 


Ala 






260 






265 




270 


Glu 


Ala Ala Pro 


Ala 


Gly 


Leu 


Cys Gly Glu 


Gin Arg Gly Glu 


Asp 






275 






280 




285 


cys 


Ala Glu Leu 


His 


Asp 


Tyr 


Phe Asn Val 


Leu Ser Tyr Arg 


Ser 






290 






295 




300 


Leu 


Gly Asn Cys 


Ser 


Phe 


Phe 


Thr Glu Thr 


Gly 








305 






310 







<210> 45 

<211> 1097 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 7503772CD1 

<400> 45 



Met 


Gly 


Ala 


Ser 


Arg 


Asp 


Arg 


Gly Leu Ala 


Ala Leu Trp Cys 


Leu 


1 








5 






10 




15 


Gly 


Leu 


Leu 


Gly 


Gly 


Leu 


Ala 


Arg Val Ala 


Gly Thr His Tyr 


Arg 










20 






25 




30 


Tyr 


Leu 


Trp 


Arg 


Gly 


Cys 


Tyr 


Pro Cys His 


Leu Gly Gin Ala 


Gly 










35 






40 




45 


Tyr 


Pro 


Val 


Ser 


Ala 


Gly 


Asp 


Gin Arg Pro 


Asp Val Asp Glu 


Cys 










50 






55 




60 


Arg 


Thr 


His 


Asn 


Gly 


Gly 


Cys 


Gin His Arg 


Cys Val Asn Thr 


Pro 










65 






70 




75 


Gly 


Ser 


Tyr 


Leu 


Cys 


Glu 


Cys 


Lys Pro Gly 


Phe Arg Leu His 


Thr 










80 






85 




90 


Asp 


Ser 


Arg 


Thr 


Cys 


Leu 


Ala 


He Asn Ser 


Cys Ala Leu Gly 


Asn 










95 






100 




105 
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Gly Gly Cys Gin His His Cys Val Gin Leu Thr lie Thr Arg His 

110 115 120 

Arg Cys Gin Cys Arg Pro Gly Phe Gin Leu Gin Glu Asp Gly Arg 

125 130 135 

His Cys Val Arg Arg Ser Pro Cys Ala Asn Arg Asn Gly Ser Cys 

140 145 150 

Met His Arg Cys Gin Val Val Arg Gly Leu Ala Arg Cys Glu Cys 

155 160 165 

His Val Gly Tyr Gin Leu Ala Ala Asp Gly Lys Ala Cys Glu Asp 

170 175 180 

Val Asp Glu Cys Ala Ala Gly Leu Ala Gin Cys Ala His Gly Cys 

185 190 195 

Leu Asn Thr Gin Gly Ser Phe Lys Cys Val Cys His Ala Gly Tyr 

200 205 210 

Glu Leu Gly Ala Asp Gly Arg Gin Cys Tyr Arg lie Glu Met Glu 

215 220 225 

lie Val Asn Ser Cys Glu Ala Asn Asn Gly Gly Cys Ser His Gly 

230 235 240 

Cys Ser His Thr Ser Ala Gly Pro Leu Cys Thr Cys Pro Arg Gly 

245 250 255 

Tyr Glu Leu Asp Thr Asp Gin Arg Thr Cys lie Asp Val Asp Asp 

260 265 270 

Cys Ala Asp Ser Pro Cys Cys Gin Gin Val Cys Thr Asn Asn Pro 

275 280 285 
Gly Gly Tyr Glu Cys Gly Cys Tyr Ala Gly Tyr Arg Leu Ser Ala 

290 295 300 

Asp Gly Cys Gly Cys Glu Asp Val Asp Glu Cys Ala Ser Ser Arg 

305 310 315 

Gly Gly Cys Glu His His Cys Thr Asn Leu Ala Gly Ser Phe Gin 

320 325 330 

Cys Ser Cys Glu Ala Gly Tyr Arg Leu His Glu Asp Arg Arg Gly 

335 340 345 

Cys Ser Pro Leu Glu Glu Pro Met Val Asp Leu Asp Gly Glu Leu 

350 355 360 

Pro Phe Val Arg Pro Leu Pro His lie Ala Val Leu Gin Asp Glu 

365 370 375 
Leu Pro Gin Leu Phe Gin Asp Asp Asp Val Gly Ala Asp Glu Glu 

380 385 390 
Glu Ala Glu Leu Arg Gly Glu His Thr Leu Thr Glu Lys Phe Val 

395 400 405 
Cys Leu Asp Asp Ser Phe Gly His Asp Cys Ser Leu Thr Cys Asp 

410 415 420 
Asp Cys Arg Asn Gly Gly Thr Cys Leu Leu Gly Leu Asp Gly Cys 

425 430 435 

Asp Cys Pro Glu Gly Trp Thr Gly Leu lie Cys Asn Glu Thr Cys 

440 445 450 
Pro Pro Asp Thr Phe Gly Lys Asn Cys Ser Phe Ser Cys Ser Cys 

455 460 465 
Gin Asn Gly Gly Thr Cys Asp Ser Val Thr Gly Ala Cys Arg Cys 

470 475 480 
Pro Pro Gly Val Ser Gly Thr Asn Cys Glu Asp Gly Cys Pro Lys 

485 490 495 

Gly Tyr Tyr Gly Lys His Cys Arg Lys Lys Cys Asn Cys Ala Asn 

500 505 510 

Arg Gly Arg Cys His Arg Leu Tyr Gly Ala Cys Leu Cys Asp Pro 

515 520 525 
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Gly Leu Tyr 


Gly 


Arg 
530 


Phe 


Cys 


His 


Leu 


Thr 
535 


Cys 


Pro 


Pro 


Trp 


Ala 
540 


Phe Gly Pro 


Gly 


Cys 
545 


Ser 


Glu 


Glu 


Cys 


Gin 
550 


Cys 


Val 


Gin 


Pro 


His 
555 


Thr Gin Ser 


Cys 


Asp 
560 


Lys 


Arg 


Asp 


Gly 


Ser 
565 


Cys 


Ser 


Cys 


Lys 


Ala 
570 


Gly Phe Arg 


Gly 


Glu 
575 


Arg 


Cys 


Gin 


Ala 


Glu 
580 


Cys 


Glu 


Leu 


Gly 


Tyr 
585 


Phe Gly Pro 


Gly 


Cys 
590 


Trp 


Gin 


Ala 


Cys 


Thr 
595 


Cys 


Pro 


Val 


Gly 


Val 
600 


Ala Cys Asp 


Ser 


Val 
605 


Ser 


Gly 


Glu 


Cys 


Gly 
610 


Lys 


Arg 


Cys 


Pro 


Ala 
615 


Gly Phe Gin 


Gly 


Glu 
620 


Asp 


Cys 


Gly 


Gin 


Glu 
625 


Cys 


Pro 


Val 


Gly 


Thr 
630 


Phe Gly Val 


Asn 


Cys 
635 


Ser 


Ser 


Ser 


Cys 


Ser 
640 


Cys 


Gly 


Gly 


Ala 


Pro 
645 


Cys His Gly 


Val 


Thr 
650 


Gly 


Gin 


Cys 


Arg 


Cys 
655 


Pro 


Pro 


Gly 


Arg 


Thr 
660 


Gly Glu Asp 


Cys 


Glu 
665 


Ala 


Asp 


Cys 


Pro 


Glu 
670 


Gly 


Arg 


Trp 


Gly 


Leu 
675 


Gly Cys Gin 


Glu 


He 
680 


Cys 


Pro 


Ala 


Cys 


Gin 
685 


His 


Ala 


Ala 


Arg 


Cys 
690 


Asp Pro Glu 


Thr 


Gly 
695 


Ala 


Cys 


Leu 


Cys 


Leu 
700 


Pro 


Gly 


Phe 


Val 


Gly 
705 


Ser Arg Cys 


Gin 


Asp 
710 


Val 


Cys 


Pro 


Ala 


Gly 
715 


Trp 


Tyr 


Gly 


Pro 


Ser 
720 


Cys Gin Thr 


Arg 


Cys 
725 


Ser 


Cys 


Ala 


Asn 


Asp 
730 


Gly 


His 


Cys 


His 


Pro 

735 


Ala Thr Gly 


His 


Cys 
740 


Ser 


Cys 


Ala 


Pro 


Gly 
745 


Trp 


Thr 


Gly 


Phe 


Ser 
750 


Cys Gin Arg 


Ala 


Cys 
755 


Asp 


Thr 


Gly 


His 


Trp 
760 


Gly 


Pro 


Asp 


Cys 


Ser 
765 


His Pro Cys 


Asn 


Cys 
770 


Ser 


Ala 


Gly 


His 


Gly 
775 


Ser 


Cys 


Asp 


Ala 


He 
780 


Ser Gly Leu 


Cys 


Leu 
785 


Cys 


Glu 


Ala 


Gly 


Tyr 
790 


Val 


Gly 


Pro 


Arg 


Cys 
795 


Glu Gin Gin 


Cys 


Pro 
800 


Gin 


Gly 


His 


Phe 


Gly 
805 


Pro 


Gly 


Cys 


Glu 


Gin 
810 


Leu Cys Gin 


Cys 


Gin 
815 


His 


Gly 


Ala 


Ala 


Cys 
820 


Asp 


His 


Val 


Ser 


Gly 
825 


Ala Cys Thr 


Cys 


Pro 
830 


Ala 


Gly 


Trp 


Arg 


Gly 
835 


Thr 


Phe 


Cys 


Glu 


His 
840 


Ala Cys Pro 


Ala 


Gly 
845 


Phe 


Phe 


Gly 


Leu 


Asp 
850 


Cys 


Arg 


Ser 


Ala 


Cys 
855 


Asn Cys Thr 


Ala 


Gly 
860 


Ala 


Ala 


Cys 


Asp 


Ala 
865 


Val 


Asn 


Gly 


Ser 


cys 
870 


Leu Cys Pro 


Ala 


Gly 
875 


Arg 


Arg 


Gly 


Pro 


Arg 
880 


Cys 


Ala 


Glu 


Thr 


Cys 
885 


Pro Ala His 


Thr 


Tyr 
890 


Gly 


His 


Asn 


Cys 


Ser 
895 


Gin 


Ala 


Cys 


Ala 


Cys 
900 


Phe Asn Gly 


Ala 


Ser 
905 


cys 


Asp 


Pro 


Val 


His 
910 


Gly 


Gin 


Cys 


His 


Cys 
915 


Ala Pro Gly 


Trp 


Met 
920 


Gly 


Pro 


Ser 


Cys 


Leu 
925 


Gin 


Glu 


Cys 


Leu 


Pro 
930 


Arg Asp Val 


Arg 


Ala 

935 


Gly 


Cys 


Arg 


His 


Ser 
940 


Gly 


Gly 


Cys 


Leu 


Asn 
945 
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Gly Gly Leu Cys Asp Pro His Thr Gly Arg Cys Leu Cys Pro Ala 
950 955 960 

Gly Trp Thr Gly Asp Lys Cys Gin Ser Pro Cys Leu Arg Gly Trp 
965 970 975 

Phe Gly Glu Ala Cys Ala Gin Arg Cys Ser Cys Pro Pro Gly Ala 
980 985 990 

Ala Cys His His Val Thr Gly Ala Cys Arg Cys Pro Pro Gly Phe 
995 1000 1005 

Thr Gly Ser Gly Cys Glu Gin Gly Cys Pro Pro Gly Arg Tyr Gly 

1010 1015 1020 

Pro Gly Cys Glu Gin Leu Cys Gly Cys Leu Asn Gly Gly Ser Cys 

1025 1030 1035 

Asp Ala Ala Thr Gly Ala Cys Arg Cys Pro Thr Gly Phe Leu Gly 

1040 1045 1050 

Thr Asp Cys Asn Leu Ser Glu Trp Leu Val Ala Ala Val Leu Ser 

1055 1060 1065 

Gly Ala Ser Cys Val Ser Arg Pro Val Arg Arg Ala Ala Ser Ala 

1070 1075 1080 

Pro Thr Ala Pro Thr Cys Val Gly Val Gly Arg Gly Arg Pro Ala 

1085 1090 1095 

Thr Leu 



<210> 46 

<211> 1350 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7503773CD1 

<400> 46 



Met 


Gly 


Ala 


Ser 


Arg 


Asp 


Arg 


Gly 


Leu 


Ala 


Ala Leu Trp 


Cys 


Leu 


1 








5 










10 






15 


Gly 


Leu 


Leu 


Gly 


Gly 
20 


Leu 


Ala 


Arg 


Val 


Ala 
25 


Gly Thr His 


Tyr 


Arg 
30 


Tyr 


Leu 


Trp 


Arg 


Gly 
35 


Cys 


Tyr 


Pro 


Cys 


His 
40 


Leu Gly Gin 


Ala 


Gly 
45 


Tyr 


Pro 


Val 


Ser 


Ala 
50 


Gly 


Asp 


Gin 


Arg 


Pro 
55 


Asp Val Asp 


Glu 


Cys 
60 


Arg 


Thr 


His 


Asn 


Gly 
65 


Gly 


Cys 


Gin 


His 


Arg 
70 


Cys Val Asn 


Thr 


Pro 
75 


Gly 


Ser 


Tyr 


Leu 


Cys 
80 


Glu 


Cys 


Lys 


Pro 


Gly 
85 


Phe Arg Leu 


His 


Thr 
90 


Asp 


Ser 


Arg 


Thr 


Cys 
95 


Leu 


Ala 


lie 


Asn 


Ser 
100 


Cys Ala Leu 


Gly 


Asn 
105 


Gly 


Gly 


Cys 


Gin 


His 
110 


His 


Cys 


Val 


Gin 


Leu 
115 


Thr lie Thr 


Arg 


His 
120 


Arg 


Cys 


Gin 


Cys 


Arg 
125 


Pro 


Gly 


Phe 


Gin 


Leu 
130 


Gin Glu Asp 


Gly 


Arg 
135 


His 


Cys 


Val 


Arg 


Arg 
140 


Ser 


Pro 


Cys 


Ala 


Asn 
145 


Arg Asn Gly 


Ser 


Cys 
150 


Met 


His 


Arg 


Cys 


Gin 
155 


Val 


Val 


Arg 


Gly 


Leu 
160 


Ala Arg Cys 


Glu 


Cys 
165 


His 


Val 


Gly 


Tyr 


Gin 


Leu 


Ala 


Ala 


Asp 


Gly 


Lys Ala Cys 


Glu 


Asp 
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170 175 180 

Val Asp Glu Cys Ala Ala Gly Leu Ala Gin Cys Ala His Gly Cys 

185 190 195 

Leu Asn Thr Gin Gly Ser Phe Lys Cys Val Cys His Ala Gly Tyr 

200 205 210 

Glu Leu Gly Ala Asp Gly Arg Gin Cys Tyr Arg lie Glu Met Glu 

215 220 225 

lie Val Asn Ser Cys Glu Ala Asn Asn Gly Gly Cys Ser His Gly 

230 235 240 

Cys Ser His Thr Ser Ala Gly Pro Leu Cys Thr Cys Pro Arg Gly 

245 250 255 

Tyr Glu Leu Asp Thr Asp Gin Arg Thr Cys He Asp Val Asp Asp 

260 265 270 

Cys Ala Asp Ser Pro Cys Cys Gin Gin Val Cys Thr Asn Asn Pro 

275 280 285 

Gly Gly Tyr Glu Cys Gly Cys Tyr Ala Gly Tyr Arg Leu Ser Ala 

290 295 300 

Asp Gly Cys Gly Cys Glu Asp Val Asp Glu Cys Ala Ser Ser Arg 

305 310 315 

Gly Gly Cys Glu His His Cys Thr Asn Leu Ala Gly Ser Phe Gin 

320 325 330 

Cys Ser Cys Glu Ala Gly Tyr Arg Leu His Glu Asp Arg Arg Gly 

335 340 345 

Cys Ser Pro Leu Glu Glu Pro Met Val Asp Leu Asp Gly Glu Leu 

350 355 360 

Pro Phe Val Arg Pro Leu Pro His He Ala Val Leu Gin Asp Glu 

365 370 375 

Leu Pro Gin Leu Phe Gin Asp Asp Asp Val Gly Ala Asp Glu Glu 

380 385 390 

Glu Ala Glu Leu Arg Gly Glu His Thr Leu Thr Glu Lys Phe Val 

395 400 405 

Cys Leu Asp Asp Ser Phe Gly His Asp Cys Ser Leu Thr Cys Asp 

410 415 420 

Asp Cys Arg Asn Gly Gly Thr Cys Leu Leu Gly Leu Asp Gly Cys 

425 430 435 

Asp Cys Pro Glu Gly Trp Thr Gly Leu He Cys Asn Glu Thr Cys 

440 445 450 

Pro Pro Asp Thr Phe Gly Lys Asn Cys Ser Phe Ser Cys Ser Cys 

455 460 465 

Gin Asn Gly" Gly Thr Cys Asp Ser Val Thr Gly Ala Cys Arg Cys 

470 475 480 

Pro Pro Gly Val Ser Gly Thr Asn Cys Glu Asp Gly Cys Pro Lys 

485 490 495 

Gly Tyr Tyr Gly Lys His Cys Arg Lys Lys Cys Asn Cys Ala Asn 

500 505 510 

Arg Gly Arg Cys His Arg Leu Tyr Gly Ala Cys Leu Cys Asp Pro 

515 520 525 

Gly Leu Tyr Gly Arg Phe Cys His Leu Thr Cys Pro Pro Trp Ala 

530 535 540 

Phe Gly Pro Gly Cys Ser Glu Glu Cys Gin Cys Val Gin Pro His 

545 550 555 

Thr Gin Ser Cys Asp Lys Arg Asp Gly Ser Cys Ser Cys Lys Ala 

560 565 570 

Gly Phe Arg Gly Glu Arg Cys Gin Ala Glu Cys Glu Leu Gly Tyr 

575 580 585 

Phe Gly Pro Gly Cys Trp Gin Ala Cys Thr Cys Pro Val Gly Val 
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590 




595 






600 


Ala Cys 


Asp 


Ser 


Val 


Ser Gly Glu Cys 


Gly 


Lys Arg 


Cys Pro 


Ala 








605 




610 






615 


Gly Phe 


Gin 


Gly 


Glu 


Asp Cys Gly Gin 


Glu 


Cys Pro 


Val Gly 


Thr 








620 




625 






630 


Phe Gly 


Val 


Asn 


Cys 


Ser Ser Ser Cys 


Ser 


Cys Gly 


Gly Ala 


Pro 








635 




640 






645 


Cys His 


Gly 


Val 


Thr 


Gly Gin Cys Arg 


Cys 


Pro Pro 


Gly Arg 


Thr 








650 




655 






660 


Gly Glu 


Asp 


Cys 


Glu 


Ala Asp Cys Pro 


Glu 


Gly Arg 


Trp Gly 


Leu 








665 




670 






675 


Gly Cys 


Gin 


Glu 


He 


Cys Pro Ala Cys 


Gin 


His Ala 


Ala Arg 


Cys 








680 




685 






690 


Asp Pro 


Glu 


Thr 


Gly 


Ala Cys Leu Cys 


Leu 


Pro Gly 


Phe Val 


Gly 








695 




700 






705 


Ser Arg 


Cys 


Gin 


Asp 


Val Cys Pro Ala 


Gly 


Trp Tyr 


Gly Pro 


Ser 








710 




715 






720 


Cys Gin 


Thr 


Arg 


Cys 


Ser Cys Ala Asn 


Asp 


Gly His 


Cys His 


Pro 








725 




730 






735 


Ala Thr 


Gly 


His 


Cys 


Ser Cys Ala Pro 


Gly 


Trp Thr 


Gly Phe 


Ser 








740 




745 






750 


Cys Gin 


Arg 


Ala 


Cys 


Asp Thr Gly His 


Trp 


Gly Pro 


Asp Cys 


Ser 








755 




760 






765 


His Pro 


Cys 


Asn 


Cys 


Ser Ala Gly His 


Gly 


Ser Cys 


Asp Ala 


He 








770 




775 






780 


Ser Gly 


Leu 


Cys 


Leu 


Cys Glu Ala Gly 


Tyr 


Val Gly 


Pro Arg 


Cys 








785 




790 






795 


Glu Gin 


Gin 


Cys 


Pro 


Gin Gly His Phe 


Gly 


Pro Gly 


Cys Glu 


Gin 








800 




805 






810 


Leu Cys 


Gin 


Cys 


Gin 


His Gly Ala Ala 


Cys 


Asp His 


Val Ser 


Gly 








815 




820 






825 


Ala Cys 


Thr 


Cys 


Pro 


Ala Gly Trp Arg 


Gly 


Thr Phe 


Cys Glu 


His 








830 




835 






840 


Ala Cys 


Pro 


Ala 


Gly 


Phe Phe Gly Leu 


Asp 


Cys Arg 


Ser Ala 


Cys 








845 




850 






855 


Asn Cys 


Thr 


Ala 


Gly 


Ala Ala Cys Asp 


Ala 


Val Asn 


Gly Ser 


Cys 








860 




865 






870 


Leu Cys 


Pro 


Ala 


Gly 


Arg Arg Gly Pro 


Arg 


Cys Ala 


Glu Thr 


Cys 








875 




880 






885 


Pro Ala 


His 


Thr 


Tyr 


Gly His Asn Cys 


Ser 


Gin Ala 


Cys Ala 


Cys 








890 




895 






900 


Phe Asn 


Gly 


Ala 


Ser 


Cys Asp Pro Val 


His 


Gly Gin 


Cys His 


Cys 








905 




910 






915 


Ala Pro 


Gly 


Trp 


Met 


Gly Pro Ser Cys 


Leu 


Gin Glu 


Cys Leu 


Pro 








920 




925 






930 


Arg Asp 


Val 


Arg 


Ala 


Gly Cys Arg His 


Ser 


Gly Gly 


Cys Leu 


Asn 








935 




940 






945 


Gly Gly 


Leu 


Cys 


Asp 


Pro His Thr Gly 


Arg 


Cys Leu 


Cys Pro 


Ala 








950 




955 






960 


Gly Trp 


Thr 


Gly 


Asp 


Lys Cys Gin Ser 


Pro 


Cys Leu 


Arg Gly 


Trp 








965 




970 






975 


Phe Gly 


Glu 


Ala 


Cys 


Ala Gin Arg Cys 


Ser 


Cys Pro 


Pro Gly 


Ala 








980 




985 






990 


Ala Cys 


His 


His 


Val 


Thr Gly Ala Cys 


Arg 


Cys Pro 


Pro Gly 


Phe 








995 


1000 




1005 


Thr Gly 


Ser 


Gly 


Cys 


Glu Gin Ala Cys 


Pro 


Pro Gly 


Ser Phe Gly 
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1010 




1015 




1020 


Glu 


Asp Cys 


Ala Gin 


Met 


Cys Gin Cys Pro 


Gly Glu Asn 


Pro Ala 






1025 




1030 




1035 


Cys 


His Pro 


Ala Thr 


Gly 


Thr Cys Ser Cys 


Ala Ala Gly 


Tyr His 






1040 




1045 




1050 


Gly 


Pro Ser 


Cys Gin 


Gin 


Arg Cys Pro Pro 


Gly Arg Tyr 


Gly Pro 






1055 




1060 




1065 


Gly 


Cys Glu 


Gin Leu 


Cys 


Gly Cys Leu Asn 


Gly Gly Ser 


Cys Asp 






1070 




1075 




1080 


Ala 


Ala Thr 


Gly Ala 


Cys 


Arg Cys Pro Thr 


Gly Phe Leu 


Gly Thr 






1085 




1090 




1095 


Asp 


Cys Asn 


Leu Thr 


Cys 


Pro Gin Gly Arg 


Phe Gly Pro 


Asn Cys 






1100 




1105 




1110 


Thr 


His Val 


Cys Gly 


Cys 


Gly Gin Gly Ala 


Ala Cys Asp 


Pro Val 






1115 




1120 




1125 


Thr 


Gly Thr 


Cys Leu 


Cys 


Pro Pro Gly Arg 


Ala Gly Val 


Arg Cys 






1130 




1135 




1140 


Glu 


Arg Gly 


Cys Pro 


Gin 


Asn Arg Phe Gly 


Val Gly Cys 


Glu His 






1145 




1150 




1155 


Thr 


Cys Ser 


Cys Arg 


Asn 


Gly Gly Leu Cys 


His Ala Ser 


Asn Gly 






1160 




1165 




1170 


Ser 


Cys Ser 


Cys Gly 


Leu 


Gly Trp Thr Gly 


Arg His Cys 


Glu Leu 






1175 




1180 




1185 


Ala 


Cys Pro 


Pro Gly 


Arg 


Tyr Gly Ala Ala 


Cys His Leu 


Glu Cys 






1190 




1195 




1200 


Ser 


Cys His 


Asn Asn 


Ser 


Thr Cys Glu Pro 


Ala Thr Gly 


Thr Cys 






1205 




1210 




1215 


Arg 


Cys Gly 


Pro Gly 


Phe 


Tyr Gly Gin Ala 


Cys Glu His 


Pro Cys 






1220 




1225 




1230 


Pro 


Pro Gly 


Phe His 


Gly 


Ala Gly Cys Gin 


Gly Leu Cys 


Trp Cys 






1235 




1240 




1245 


Gin 


His Gly 


Ala Pro 


Cys 


Asp Pro lie Ser 


Gly Arg Cys 


Leu Cys 






1250 




1255 




1260 


Pro 


Ala Gly 


Phe His 


Gly 


His Phe Cys Glu 


Arg Gly Cys 


Glu Pro 






1265 




1270 




1275 


Gly 


Ser Phe 


Gly Glu 


Gly 


Cys His Gin Arg 


Cys Asp Cys 


Asp Gly 






1280 




1285 




1290 


Gly 


Ala Pro 


Cys Asp 


Pro 


Val Thr Gly Leu 


Cys Leu Cys 


Pro Pro 






1295 




1300 




1305 


Gly 


Arg Ser 


Gly Ala 


Thr 


Cys Asn Leu Gly 


Gly Pro Leu 


Arg Leu 






1310 




1315 




1320 


Pro 


Glu Asn 


Pro Ser 


Leu 


Ala Gin Gly Ser 


Ala Gly Thr 


Leu Pro 






1325 




1330 




1335 


Ala 


Ser Ser 


Arg Pro 


Thr 


Ser Arg Ser Gly 


Gly Pro Ala 


Arg His 






1340 




1345 




1350 



<210> 47 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504698CD1 
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<400> 47 

Met Glu Val Pro Pro Pro Ala Pro Arg Ser Phe Leu Cys Arg Ala 
15 10 15 

Leu Cys Leu Phe Pro Arg Val Phe Ala Ala Glu Ala Val Thr Ala 

20 25 30 

Asp Ser Glu Val Leu Glu Glu Arg Gin Lys Arg Leu Pro Tyr Val 

35 40 45 

Pro Glu Pro Tyr Tyr Pro Glu Ser Gly Trp Asp Arg Leu Arg Glu 

50 55 60 

Leu Phe Gly Lys Asp Cys His Gly Lys Ser Phe 



<210> 48 

<211> 220 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510361CD1 

<400> 48 

Met Leu Gin Leu Trp Lys Leu Val Leu Leu Cys Gly Val Leu Thr 

15 10 15 

Gly Thr Ser Glu Ser Leu Leu Asp Asn Leu Gly Asn Asp Leu Ser 

20 25 30 

Asn Val Val Asp Lys Leu Glu Pro Val Leu His Glu Gly Leu Glu 

35 40 45 

Thr Val Asp Asn Thr Leu Lys Gly lie Leu Glu Lys Leu Lys Val 

50 55 60 

Asp Leu Gly Val Leu Gin Lys Ser Ser Ala Trp Gin Leu Ala Lys 

65 70 75 

Gin Lys Ala Gin Glu Ala Glu Lys Leu Leu Asn Asn Val lie Ser 

80 85 90 
Lys Leu Leu Pro Thr Asn Thr Asp lie Phe Gly Leu Lys He Ser 

95 100 105 

Asn Ser Leu He Leu Asp Val Lys Ala Glu Pro He Asp Asp Gly 

110 115 120 

Lys Gly Leu Asn Leu Ser Phe Pro Val Thr Ala Asn Val Thr Val 

125 130 135 

Ala Gly Pro He He Gly Gin He He Asn Leu Lys Ala Ser Leu 

140 145 150 
Asp Leu Leu Thr Ala Val Thr He Glu Thr Asp Pro Gin Thr His 

155 160 165 
Gin Pro Val Ala Val Leu Gly Glu Cys Ala Ser Asp Pro Thr Ser 

170 175 180 

He Ser Leu Ser Leu Leu Asp Asn Gin Lys Cys He Glu Ala Gly 

185 190 195 
His Asp Gly Ser Thr Pro Val He Pro Ala Leu Trp Glu Ala Glu 

200 205 210 
Thr Gin Pro Asn His Gin Gin Val Arg Glu 



65 



70 



215 



220 



<210> 49 
<211> 39 
<212> PRT 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7507013CD1 

<400> 49 

Met Arg Gly His Pro Ser Leu Leu Leu Leu Tyr Met Ala Leu Thr 
15 10 15 

Thr Cys Leu Asp Thr Ser Pro Ser Glu Glu Thr Asp Gin Glu Val 

20 25 30 

Phe Leu Gly Arg Gly Gin Gly Val Phe 

35 

<210> 50 

<211> 451 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7510507CD1 

<400> 50 

Met Ala Ser Ala Thr Glu Asp Pro Val Leu Glu Arg Tyr Phe Lys 
15 10 15 

Gly His Lys Ala Ala lie Thr Ser Leu Asp Leu Ser Pro Asn Gly 
20 25 30 

Lys Gin Leu Ala Thr Ala Ser Trp Asp Thr Phe Leu Met Leu Trp 
35 40 45 

Asn Phe Lys Pro His Ala Arg Ala Tyr Arg Tyr Val Gly His Lys 
50 55 60 

Asp Val Val Thr Ser Val Gin Phe Ser Pro His Gly Asn Leu Leu 
65 70 75 

Ala Ser Ala Ser Arg Asp Arg Thr Val Arg Leu Trp lie Pro Asp 
80 85 90 

Lys Arg Gly Lys Phe Ser Glu Phe Lys Ala His Thr Ala Pro Val 
95 100 105 

Arg Ser Val Asp Phe Ser Ala Asp Gly Gin Phe Leu Ala Thr Ala 
110 115 120 

Ser Glu Asp Lys Ser lie Lys Val Trp Ser Met Tyr Arg Gin Arg 
125 130 135 

Phe Leu Tyr Ser Leu Tyr Arg His Thr His Trp Val Arg Cys Ala 
140 145 150 

Lys Phe Ser Pro Asp Gly Arg Leu lie Val Ser Cys Ser Glu Asp 
155 160 165 

Lys Thr lie Lys lie Trp Asp Thr Thr Asn Lys Gin Cys Val Asn 
170 175 180 

Asn Phe Ser Asp Phe Val Gly Phe Ala Asn Phe Val Asp Phe Asn 
185 190 195 

Pro Ser Gly Thr Cys lie Ala Ser Ala Gly Ser Asp Gin Thr Val 
200 205 210 

Lys Val Trp Asp Val Arg Val Asn Lys Leu Leu Gin His Tyr Gin 
215 220 225 

Val His Ser Gly Gly Val Asn Cys lie Ser Phe His Pro Ser Gly 
230 235 240 
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Asn Tyr 
Asp Leu 
Gly Pro 
Ala Ser 
Phe Asp 
Arg Leu 
Arg Thr 
Thr Glu 
Cys Gly 
Pro Thr 
Cys Glu 
Glu His 
lie Leu 
Cys Leu 
Ser 



Leu lie Thr 
245 

Leu Glu Gly 
260 

Val Phe Thr 
275 

Gly Gly Ala 
290 

Glu Leu His 
305 

His Phe Asp 
320 

Pro His Pro 
335 

Thr Ser Gly 
350 

Tyr Phe Leu 
365 

Thr Thr Lys 
380 

Ser Gin Arg 
395 

He Met Glu 
410 

Glu Gin Arg 
425 

Glu Asn Gin 
440 



Ala Ser Ser 
Arg Leu He 
Val Ser Phe 
Asp Thr Gin 
Cys Lys Gly 
Ser Pro Pro 
His Glu Glu 
Arg Thr Leu 
Asn Pro Ser 
Lys Lys Thr 
Ser He Pro 
Gin Leu Asn 
Leu Thr Leu 
Gin Lys Leu 



Asp Gly Thr 

250 
Tyr Thr Leu 

265 
Ser Lys Gly 

280 
Val Leu Leu 

295 
Leu Thr Lys 

310 
His Leu Leu 

325 
Lys Val Glu 

340 
Pro Asp Lys 

355 
Leu Met Ser 

370 
Glu Asp Met 

385 
Leu Ala Val 

400 
Val Leu Thr 

415 
Thr Glu Asp 

430 
Phe Ser Ala 

445 



Leu Lys 
Gin Gly 
Gly Glu 
Trp Arg 
Arg Asn 
Asp He 
Thr Val 
Gly Glu 
Pro Glu 
Ser Asp 
Thr Asp 
Gin Thr 
Lys Leu 
Val Gin 



He Leu 

255 
His Thr 

270 
Leu Phe 

285 
Thr Asn 

300 
Leu Lys 

315 
Tyr Pro 

330 
Glu Thr 

345 
Glu Ala 

360 
Cys Leu 

375 
Leu Pro 

390 
Ala Leu 

405 
Val Ser 

420 
Lys Asp 

435 
Gin Lys 

450 



<210> 51 

<211> 224 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 90106370CD1 

<400> 51 

Met Pro Leu Thr Pro Glu Pro Pro Ser Gly Arg Val Glu Gly Pro 
15 10 15 

Pro Ala Trp Glu Ala Ala Pro Trp Pro Ser Leu Pro Cys Gly Pro 

20 25 30 

Cys He Pro He Met Leu Val Leu Ala Thr Leu Ala Ala Leu Phe 

35 40 45 

He Leu Thr Thr Ala Val Leu Ala Glu Arg Leu Phe Arg Arg Ala 

50 55 60 

Leu Arg Pro Asp Pro Ser His Arg Ala Pro Thr Leu Val Trp Arg 

65 70 75 

Pro Gly Gly Glu Leu Trp He Glu Pro Met Gly Thr Ala Arg Glu 

80 85 90 

Arg Ser Glu Asp Trp Tyr Gly Ser Ala Val Pro Leu Leu Thr Asp 

95 100 105 

Arg Ala Pro Glu Pro Pro Thr Gin Val Gly Thr Leu Glu Ala Arg 
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Ala Thr 
Asn Leu 
Phe Trp 
Leu Val 
Gin Phe 
Asp Pro 
Ser Ala 



110 

Ala Pro Pro 
125 

Gly Pro Gin 
140 

Gly Pro Gin 
155 

Ser Trp Ala 
170 

Gly Ser Pro 
185 

Glu Trp Gly 
200 

Phe Trp Lys 
215 



Ala Pro 
Thr Val 
Pro Trp 
Glu Pro 
Gin Ala 
Leu Gin 
Arg Glu 



Ser Ala 
Leu Glu 
Glu Gly 
Glu Gin 
Arg Arg 
Pro Arg 
Gly Arg 



115 

Pro Asn 
130 

Val Pro 
145 

Arg Pro 
160 

Arg Pro 
175 

Gin Arg 
190 

Val Thr 
205 

Thr Ser 
220 



Ser Ala 
Ala Arg 
Pro Ala 
Glu Ala 
Pro Gly 
Leu Glu 
Val Gly 



120 
Pro Ser 

135 
Ser Thr 

150 
Thr Gly 

165 
Ser Val 

180 
Ser Pro 

195 
Gin He 

210 

Phe 



<210> 52 
<211> 1480 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506904CB1 



<400> 52 

aatctgcaaa gttcctgtga gcgctgtcat 
ctggaggctg gagtttccag gatgtcaaaa 
agctgggata cctgtgtcac tcctgctgtc 
cctccccagg agtagggagg aaccaggtgg 
caggccttgg acgctgcagc acttctatct 
cctcctcctc ctgctgtggc tgagccttgg 
gaacctttgc tgtcttggga cggatcacca 
attctgccat gtggcaccag actgccaccc 
ctcacataca ggcccgattc cacctacagc 
accctttgcc tgcacttcag gccaaagccg 
tgacaatcct gagatcttgc atcttgtcta 
atggtgctgc agatggtgct gaggatggag 
gactggatgc agagcatggt gagctccctg 
ctgctccttg cctttgtgcc ctccaggccc 
ccgtagctag ggcagagctc cactgcaatg 
gtctgtggac actgaaattt gaatctcata 
tgttcttctc ctaccatcta aaaatgtaga 
acaggcagtg gccagatttg gcccatagac 
cctccatttc tttccttctg tgtccttgtt 
tctttccctg tttggtgcca tgaagccaat 
gcaggcttta tttggtggcc atggaattga 
ttctgctcgt gagacccagg aagtcacaga 
attaaaagca aggggaggag tggccaggtg 
tgggaggcca aaatgagagg attgctgaga 
agtgatacac ccccatctct acaaaaataa 



tttgtcactc tggtttttca gattcttccc 60 
ttacctctgc ttgggtgagc tatttcaagc 120 
tgccagtgac tgcccaggtg tctgctggtt 180 
gctggctggg atgggtggat atttaaagac 240 
ctgcttgatg cctgctgcca cgtggctggt 300 
ggtgaagaca ggcagctgct cccaacccca 360 
ctgcaagagg ggaagttgct actgtgatga 420 
agaccacagt gtcctctgca accctggtaa 480 
aaagctggat gcgatggctg gcagaggcaa 540 
ggatgtggcc tagatggttc ctaaggtccc 600 
tttcaggtca aagcttctca gatgaccaag 660 
aacccaccaa gccccgctag gagccaccta 720 
caggttctct gagaaggggt ggatggcagc 780 
caaagtcagg gaaccaaaag aagaaagggg 840 
attgttttag gggtaggagc caggattgcc 900 
tacttttgtg acaaaacatt cttcctcttt 960 
aaacattctt agcctatgag ttgcacaaaa 1020 
catagtttgc tgacttctgc cctaaatcat 1080 
actgacaaag ccactttccc taaaatgggg 1140 
atgcaaaacc gaaagtgagc ctcaagcagt 1200 
gaagtgagag cttggctcac aaatcaactt 1260 
tacagggcat ctttagtgaa ggggctgagc 1320 
caatggctca ctcccataaa cccagaactt 1380 
ccaggagttc gagaccatcc tggtcaacat 1440 
aaaaaaaaaa 1480 



<210> 53 

<211> 1168 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7506909CB1 



<400> 53 

aatctgcaaa gttcctgtga gcgctgtcat 
ctggaggctg gagtttccag gatgtcaaaa 
agctgggata cctgtgtcac tcctgctgtc 
cctccccagg agtagggagg aaccaggtgg 
caggccttgg acgctgcagc acttctatct 
cctcctcctc ctgctgtggc tgagccttgg 
ggtgctgcag atggtgctga ggatggagaa 
ctggatgcag agcatggtga gctccctgca 
gctccttgcc tttgtgccct ccaggcccca 
gtagctaggg cagagctcca ctgcaatgat 
ctgtggacac tgaaatttga atctcatata 
ttcttctcct accatctaaa aatgtagaaa 
aggcagtggc cagatttggc ccatagacca 
tccatttctt tccttctgtg tccttgttac 
tttccctgtt tggtgccatg aagccaatat 
aggctttatt tggtggccat ggaattgaga 
ctgctcgtga gacccaggaa gtcacagata 
taaaagcaag gggaggagtg gccaggtgca 
ggaggccaaa atgagaggat tgctgagacc 
tgatacaccc ccatctctac aaaaataa 



tttgtcactc tggtttttca gattcttccc 60 
ttacctctgc ttgggtgagc tatttcaagc 120 
tgccagtgac tgcccaggtg tctgctggtt 180 
gctggctggg atgggtggat atttaaagac 240 
ctgcttgatg cctgctgcca cgtggctggt 300 
ggtgaagaca gcttctcaga tgaccaagat 360 
cccaccaagc cccgctagga gccacctaga 420 
ggttctctga gaaggggtgg atggcagcct 480 
aagtcaggga accaaaagaa gaaaggggcc 540 
tgttttaggg gtaggagcca ggattgccgt 600 
cttttgtgac aaaacattct tcctcttttg 660 
acattcttag cctatgagtt gcacaaaaac 720 
tagtttgctg acttctgccc taaatcatcc 780 
tgacaaagcc actttcccta aaatggggtc 840 
gcaaaaccga aagtgagcct caagcagtgc 900 
agtgagagct tggctcacaa atcaactttt 960 
cagggcatct ttagtgaagg ggctgagcat 1020 
atggctcact cccataaacc cagaactttg 1080 
aggagttcga gaccatcctg gtcaacatag 1140 

1168 



<210> 54 
<211> 2229 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7507096CB1 



<400> 54 

agactctgcg cacgcgcagt tccccctccc 
catgcgcggg gagggcggag ctgggcgttg 
gcggggcgtg gttccggggc ggggcgcggc 
ggcgacactc agcttcgtct tcctgctgct 
ctccggccag gagttctggc ccggacaatc 
ccgcagacgg tatcttctgt atgacgtcaa 
tgtctatatc cgaatcgcct ctctcctgaa 
tgtcctgcct ccatggggcc gcctctatca 
gattccctgg tctgagtttt ttgatcttcc 
gtatgagcag ttcatcgcag ggctcagcct 
catcagcccg gtccgtgatg ttagacagag 
ggaaagaata ctgggatacc cgtcgcagca 
gagacgagtt caggagcaga catctcaact 
aggaggactg gatgaagatg aaggtcaagc 
gagtccacct gagaagaaaa gatttcatct 
aaggggccgt gaggaagatc cgcagcctca 
tggccacaga tgccgtcaga aaggaatatg 
tgaggtttga acccacgtgg gaggagctgg 
ttgaccagtg gatctgcgca cacgccaggt 
cttttcggat tcatgaggaa agagaaatcc 



gcccctgctt gccggtgatg gcgcataacg 60 
ccgtggctac tgggaacgca tttcacgggg 120 
cgccggaagt gcgtggccgc ccggggccat 180 
gggggcagtg tcctggcctc cggcttctgc 240 
ggcggccgat attctgtcgg gggcggcttc 300 
ccccccggaa ggcttcaacc tgcgcaggga 360 
gactctgctg aagacggagg agtgggtgct 420 
ctggcagagt cctgacatcc accaggtccg 480 
aagtctcaat aaaaacatcc ccgtcatcga 540 
ccatcgtggc gcccctgctg ctgagaaaca 600 
ccgagaacct acttcacgac cactatggag 660 
tggtgtttgc caggcacctg cgggaggtgg 720 
ccacggacga cgcagacagg atccccttcc 780 
tgggctccgc gctagggggc ccctacctgg 840 
ggggtcacag acaggatgta cccagtctgg 900 
tgaagaccca ccggctggac aaggtgtttg 960 
aagagctaaa aaagctgtta cccgagatgg 1020 
agctctacaa ggacggaggc gttgcgatta 1080 
tttttattgg cacctcagtc tcaacatttt 1140 
tggggttgga ccccaagacg acgtacaaca 1200 
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ggttctgcgg agaccaagag aaggcgtgtg 

gaggaggatc ctccagggcc gctccccgga 

tcgccgtgga gtcaccgtct actgccagcc 

agggtcccag gcccagaaga ggccccacgc 

ccagccgcca tggctgatga agaggctccg 

cagcgtctgt gaacccacag ctcggttgcc 

gagtgtttgt caggctcccg ctctggcctt 

agtgcattga ggatagatgc aggcgggtga 

catttccctg ggagctggtg caaggagaag 

gggcctcatg aatctctccg ttgccctccg 

ctgggatgcg tgcaaggcag acctggtgct 

ccccgtggag ggatgacagt tcaggcccta 

ttcagtgacg tctctggggc actcagtgga 

tcacaccagg cccggctggc cacccaccct 

gactgccccg ccccaccctc accccgactg 

cctcgccccg gctggcaggt gcacatgggg 

tagaaatgag gaaggacagt tacgctaact 
tcagggagc 



agcaacccac ccactggaag atcacctact 1260 
cccgacaggc gcgggtggat gcaggttctg 1320 
gggagctggg cggacaggac cgtccctcgc 1380 
ctctagagct gggctccgtc ctcggcgttg 1440 
ctgctctcgg gggtggcggt tgttttcagg 1500 
agcagtgccc gcgtggtgac ccagaagcag 1560 
tccagccacc tttcatgtct tcatatttta 1620 
gctgccctcc gtcaggtgga cccgggctga 1680 
cgtcatttta aatgtctgca gagcgaccag 1740 
cgcagcagga ggctgcctgt gtgtttcctc 1800 
gcaaaggaaa gggcctgagg cctcagggag 1860 
ctgctggcac gtcagagcac tgggaagttt 1920 
ttgtctgtag gaaacttgca gctctgctcc 1980 
cgcccccact ggccacccct ccctcgcccc 2040 
ccccgccctc gcccggctgg ccgtccctgc 2100 
cctccaggtc tgccattcgc tattgagaac 2160 
ccaaaaggct gtctaggatg agctgcttta 2220 

2229 



<210> 55 
<211> 2374 
<212> UNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7507098CB1 



<400> 55 

agactctgcg cacgcgcagt tccccctccc 
catgcgcggg gagggcggag ctgggcgttg 
gcggggcgtg gttccggggc ggggcgcggc 
ggcgacactc agcttcgtct tcctgctgct 
ctccggccag gagttctggc ccggacaatc 
ccgcagacgg tatcttctgt atgacgtcaa 
tgtctatatc cgaatcgcct ctctcctgaa 
tgtcctgcct ccatggggcc gcctctatca 
gattccctgg tctgagtttt ttgatcttcc 
gtatgagcag ttcatcgcag aatctggtgg 
aagttacgca gaggggtgga aagaagggac 
tattgatcag ctcctgtact cccaggacaa 
gtggcgcccc tgctgctgag aaacacatca 
aacctacttc acgaccacta tggagggaaa 
tttgccaggc acctgcggga ggtgggagac 
gacgacgcag acaggatccc cttccaggag 
tccgcgctag ggggccccta cctgggagtc 
cacagacagg atgtacccag tctggaaggg 
acccaccggc tggacaaggt gtttgtggcc 
ctaaaaaagc tgttacccga gatggtgagg 
tacaaggacg gaggcgttgc gattattgac 
attggcacct cagtctcaac attttctttt 
ttggacccca agacgacgta caacaggttc 
cccacccact ggaagatcac ctactgagga 
caggcgcggg tggatgcagg ttctgtcgcc 
ctgggcggac aggaccgtcc ctcgcagggt 
gagctgggct ccgtcctcgg cgttgccagc 



gcccctgctt gccggtgatg gcgcataacg 60 
ccgtggctac tgggaacgca tttcacgggg 120 
cgccggaagt gcgtggccgc ccggggccat 180 
gggggcagtg tcctggcctc cggcttctgc 240 
ggcggccgat attctgtcgg gggcggcttc 300 
ccccccggaa ggcttcaacc tgcgcaggga 360 
gactctgctg aagacggagg agtgggtgct 420 
ctggcagagt cctgacatcc accaggtccg 480 
aagtctcaat aaaaacatcc ccgtcatcga 540 
gtcctttatt gaccaggttt acgtcctgca 600 
ctgggaagag aaggtggacg agcggccgtg 660 
gcacgagtac tacagggctc agcctccatc 720 
gcccggtccg tgatgttaga cagagccgag 780 
gaatactggg atacccgtcg cagcatggtg 840 
gagttcagga gcagacatct caactccacg 900 
gactggatga agatgaaggt caagctgggc 960 
cacctgagaa gaaaagattt catctggggt 1020 
gccgtgagga agatccgcag cctcatgaag 1080 
acagatgccg tcagaaagga atatgaagag 1140 
tttgaaccca cgtgggagga gctggagctc 1200 
cagtggatct gcgcacacgc caggtttttt 1260 
cggattcatg aggaaagaga aatcctgggg 1320 
tgcggagacc aagagaaggc gtgtgagcaa 1380 
ggatcctcca gggccgctcc ccggacccga 1440 
gtggagtcac cgtctactgc cagccgggag 1500 
cccaggccca gaagaggccc cacgcctcta 1560 
cgccatggct gatgaagagg ctccgctgct 1620 
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ctcgggggtg gcggttgttt tcaggcagcg tctgtgaacc cacagctcgg ttgccagcag 1680 

tgcccgcgtg gtgacccaga agcaggagtg tttgtcaggc tcccgctctg gcctttccag 1740 

ccacctttca tgtcttcata ttttaagtgc attgaggata gatgcaggcg ggtgagctgc 1800 

cctccgtcag gtggacccgg gctgacattt ccctgggagc tggtgcaagg agaagcgtca 1860 

ttttaaatgt ctgcagagcg accaggggcc tcatgaatct ctccgttgcc ctccgcgcag 1920 

caggaggctg cctgtgtgtt tcctcctggg atgcgtgcaa ggcagacctg gtgctgcaaa 1980 

ggaaagggcc tgaggcctca gggagccccg tggagggatg acagttcagg ccctactgct 2040 

ggcacgtcag agcactggga agtttttcag tgacgtctct ggggcactca gtggattgtc 2100 

tgtaggaaac ttgcagctct gctcctcaca ccaggcccgg ctggccaccc accctcgccc 2160 

ccactggcca cccctccctc gccccgactg ccccgcccca ccctcacccc gactgccccg 2220 

ccctcgcccg gctggccgtc cctgccctcg ccccggctgg caggtgcaca tggggcctcc 2280 

aggtctgcca ttcgctattg agaactagaa atgaggaagg acagttacgc taactccaaa 2340 

aggctgtcta ggatgagctg ctttatcagg gage 2374 

<210> 56 

<211> 2151 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7507099CB1 



<400> 56 

agactctgeg cacgcgcagt tccccctccc 
catgegeggg gagggeggag ctgggcgttg 
gcggggcgtg gttcegggge ggggegegge- 
ggcgacactc agettegtet tcctgctgct 
ctccggccag gagttctggc ccggacaatc 
ccgcagacgg tatcttctgt atgacgtcaa 
tgtctatatc cgaatcgcct ctctcctgaa 
tgtcctgcct ccatggggcc gcctctatca 
gattccctgg tctgagtttt ttgatcttcc 
gtatgagcag ttcatcgcag aatctggtgg 
aagttacgca gaggggtgga aagaagggac 
tattgatcaa ctccacggac gaegcagaca 
tgaaggtcaa gctgggctcc gcgctagggg 
aagatttcat ctggggtcac agacaggatg 
tccgcagcct catgaagacc caccggctgg 
gaaaggaata tgaagagcta aaaaagctgt 
gggaggagct ggagctctac aaggaeggag 
cacacgccag gttttttatt ggcacctcag 
aaagagaaat cctggggttg gaccccaaga 
agaaggegtg tgagcaaccc acccactgga 
ccgctccccg gacccgacag gcgcgggtgg 
ctactgccag ccgggagctg ggeggacagg 
gaggccccac gectctagag ctgggctccg 
gaagaggctc cgctgctctc gggggtggcg 
agctcggttg ccagcagtgc ccgcgtggtg 
cgctctggcc tttccagcca cctttcatgt 
geaggegggt gagctgccct ccgtcaggtg 
tgcaaggaga agegtcattt taaatgtctg 
cgttgccctc cgcgcagcag gaggctgect 
agacctggtg ctgcaaagga aagggcctga 
gttcaggccc tactgetgge aegtcagage 
gcactcagtg gattgtctgt aggaaacttg 



gcccctgctt gccggtgatg gcgcataacg 60 
ccgtggctac tgggaacgea tttcacgggg 120 
cgccggaagt gcgtggccgc ccggggccat 180 
gggggcagtg tcctggcctc cggcttctgc 240 
ggeggecgat attctgtegg gggeggctte 300 
ccccccggaa ggcttcaacc tgcgcaggga 360 
gaetctgetg aagaeggagg agtgggtgct 420 
ctggcagagt cctgacatcc accaggtccg 480 
aagtctcaat aaaaacatcc ccgtcatcga 540 
gecctttatt gaccaggttt acgtcctgca 600 
ctgggaagag aaggtggacg ageggcegtg 660 
ggatcccctt ccaggaggac tggatgaaga 720 
gcccctacct gggagtccac ctgagaagaa 780 
tacccagtct ggaaggggee gtgaggaaga 840 
acaaggtgtt tgtggccaca gatgeegtea 900 
tacccgagat ggtgaggttt gaacccacgt 960 
gcgttgcgat tattgaccag tggatctgcg 1020 
tctcaacatt ttcttttegg attcatgagg 1080 
egaegtacaa caggttctgc ggagaccaag 1140 
agatcaccta ctgaggagga tcctccaggg 1200 
atgcaggttc tgtcgccgtg gagtcaccgt 1260 
accgtccctc gcagggtccc aggeccagaa 1320 
tcctcggcgt tgccagccgc catggctgat 1380 
gttgttttca ggcagegtet gtgaacccac 1440 
acccagaagc aggagtgttt gtcaggctcc 1500 
cttcatattt taagtgcatt gaggatagat 1560 
gacccgggct gacatttccc tgggagctgg 1620 
cagagcgacc aggggectea tgaatctctc 1680 
gtgtgtttcc tcctgggatg cgtgcaaggc 1740 
ggcctcaggg agccccgtgg agggatgaca 1800 
actgggaagt ttttcagtga cgtctctggg 1860 
cagctctgct cctcacacca ggcccggctg 1920 
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gccacccacc ctcgccccca ctggccaccc 
tcaccccgac tgccccgccc tcgcccggct 
gtgcacatgg ggcctccagg tctgccattc 
gttacgctaa ctccaaaagg ctgtctagga 



ctccctcgcc ccgactgccc cgccccaccc 1980 
ggccgtccct gccctcgccc cggctggcag 2040 
gctattgaga actagaaatg aggaaggaca 2100 
tgagctgctt tatcagggag c 2151 



<210> 57 

<211> 3435 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7501399CB1 



<400> 57 

ggcgagcccg agccgccgcc ggccccgcgg 
tgtgcctgct gggctgccac gtctggaagg 
ccggagccca agaggtgacg ttaaaggtgc 
tagcagatgc gctcatcgag atcttcacca 
ggactgatgg cgtcgccttt atcaagttcc 
ccgcctcgaa gcatgcctac gtgccaaact 
tattttcctc tctgagcctt ggcctgcttc 
aagatgtcgt ccaaatagta tcaggattcc 
tccagagaag ggctctgagg ttgcctgaga 
tcacggccgc cagctcccct tcggaggtgg 
gaaatggaac aggaaacagc accaggeatg 
acttgctgag cagtaatgga acgccggtgc 
ccctggccac gcagagcagc ctgaggcaca 
agaagctggg aacgtggctg aagagcggtc 
tgacgtggac atacattgcc ccccagttgg 
tcccaggagg aagtgcttga aacctcgtca 
actggagagt tccaaaagag accagtccac 
acgccgagcg tcagaattcc ctggcccgct 
ccccggcacg aaggaactga tgagtggagt 
aggggacctg cacaccccca tgctcaagct 
ccgggaggag ctcctctctt gcaaggaaga 
cactccaagt gggacgctgg ggaaagacta 
ggcaagaaaa tctatggaaa gagaaggcta 
tagttacaac accgtgctct cacagccttt 
ctccacggga agcaaactca ccattcagga 
gaaagaacag ctgctggacc gcagacccac 
cctcgagaga cctacgtcct tcccacggcc 
ccaggtcaat gacagcgttt acaggaaagt 
tatgaaactc cccggggacc actcctatgt 
gcagcttgag atagaaagac tacaggctga 
acacccgtcc tcacagatcc agccccagcc 
cctgcaggat gcgggcaccc gggagtggag 
ctccatccca gcttccctga acgacgcggc 
cctgactgaa aaggccctga tggagcttgg 
gtggttcgtc tccttggatg gcaggtccaa 
ccaaagagct ggaaggaacg gaagtaatga 
tgaaccaaaa tcagcccgga agggaagggg 
gcccgtccaa gagcaccagc agaaagagcc 
gctcgtgtac ctggatgacg tggaacagag 
ccccgaggac agtgccctgc gatgcttgtt 
gctgcccagc ctgcaggagg agacgaccag 



cggagatgag caggtccgcg acgctgctgc 60 
cggtgaccaa gacgctgcgg gagcccggcg 120 
acatcagcga cgccagcacc caccagcccg 180 
accaggcctc catagcctct ggcacctcgg 240 
agtataagct gggcagtcag ttgattgtca 300 
ctgccccatg gaagccaatc cggttacctg 360 
cagaacgcte tgccactcta atggtatatg 420 
aaggtgcccg gccacagcct cgcgttcatt 480 
acaccagcta cagtgacctg accgcgtttc 540 
acagttttcc ttatttgcga ggattagacg 600 
acctgacccc agtcacagcc gtcagcgtcc 660 
tggtggatgg tcccatctat gtcactgtgc 720 
atgcctatgt cgcggcgtgg cggtttgacc 780 
tgggtcttgt gcaccaggaa ggcagccagc 840 
ggtactgggt ggccgccatg tcccctccca 900 
gcaccacaga aaactgcagc tccctgcagg 960 
gtccatgtca cacattaact tgctgttttc 1020 
gtccgtcacc agccacggcc gccccgaggc 1080 
ccatttggaa atgatgtctc cgggcggcga 1140 
ctcctacagc acctcccagg aatttagctc 1200 
ggataaaagc cagatctcct ttgataacct 1260 
ccataagtca gtggaggttt ttcccttaaa 1320 
cgagtcctcg ggcaatgatg actacagggg 1380 
atttgaaaag caggacagag aaggtccagc 1440 
acatctgtac cccgcgcctt catcacctga 1500 
tgaatgtatg atgtcgcgat cagtagatca 1560 
cggecagtta atctgctgca gttctgtcga 1620 
actgcctgcc ttggtcatcc cggctcatta 1680 
cagccagccc ct eg teg tec eggctgatca 1740 
gctgtccaat ecccatgecg ggatcttccc 1800 
cctgtettec caggccatct ctcagcagca 1860 
ccctcagaac gcatccatgt eggagtctet 1920 
tttggctcag atgaacagtg aggtgeaget 1980 
gggtgggaag ccgcttccgc acccccgggc 2040 
egcteaegtt agacattcat acattgatct 2100 
tgccagtttg gactctggcg tagatatgaa 2160 
agatgetttg tctctgcagc agaactacce 2220 
tcgagcccca gaeagcaegg ectacaegca 2280 
tggtagcgaa tgtgggacca eggtctgtae 2340 
ggaggggtcg agtcggagaa gtggtggeca 2400 
aeggactgeg gatgccccct cggagccagc 2460 
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agccagcccc caccagagaa gatctgccca 
ccaaggagaa gacaagaaaa gcccctggca 
taacattaaa tgagctatcg cagacccacc 
ctttctcatg gaagcgcgta cccgttcgtg 
acggacgtgg acgtttactg cattcctgtt 
ttattactcg gaataaagga tgactttggc 
agtggtgtcc agatgtcctt ccgaggactc 
ccgcacgtga tgtcgctatg ctctaacgat 
agatgaaact ttttgcatcg cttgagtcat 
tcgttgagaa cttaggttgg cacgtagcgt 
tatgcatcgc tgcttcgttg tctgattttg 
tggggtttgc ctttttttgt tgtcgcttaa 
ttgtgttcat ctgttccgct tctcagcggt 
gagtgtcctc tcttaacatc caagcctttt 
aagagtcctc caatcctggt cccattaact 
agtccctcgc tttttgttgt tgttggtttt 
ttatacagtt attaa 



cgaggaagag gaagacgatg atgatgatga 2520 

gaaacgggag gagaggcccc tgatggcgtt 2580 

tgactgtgga atataaaatt gccaaatatc 2640 

gaggaaacgg aacggcagcc cagccgtggg 2700 

tgccgtgtaa atgttagaaa ggaattaaag 2760 

ggatgtcgcc cctgcaagga ggtggctgaa 2820 

ggcgtatccg ccaccaggga cattaagaaa 2880 

cacctcagtt ctccctcgga ttctgggaac 2940 

ttttatcaca ataatcctac tgtgaagctg 3000 

ctcaaggtat gcgttctctc aaaggaaagc 3060 

cttagatttt gctttggtta ggttgcgttt 3120 

atgcaatttg gttgtaaaga tttgattcct 3180 

ccatctcagc gtctcccttc aggaaccgct 3240 

aatgaaatcg Cactgaaatc tgtatcagct 3300 

ccaagtgcct ttttgacagt gacaacagac 3360 

cttaacccct ttaatggaac tgcctggatt 3420 

3435 



<210> 58 
<211> 751 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 

<223> Incyte ID No: 7504768CB1 



<400> 58 

agagctgtgt caccatgtgg 
ggcagtctgc ggcggtgttc 
caggaacaaa agcgtgatct 
ccaggtattt caggtcagcc 
gaatcgattc ctcaggccag 
agagcctgcc gagctcacgg 
actggggacc acctgctacg 
cccaaagaaa cttcagtgtg 
tcaccctcag aaggtgacca 
cacctgctcg ggtgattctg 
gtcatggggc agtgaaccat 
gcattaccgg aagtggatca 
tccctattgt aaaaaacttg 



gtcccggttg tcttcctcac 
tggtgcaccc ccagtgggtc 
tgctgggtcg gcacagcctg 
acagcttccc acacccgctc 
gtgatgactc cagccacgac 
atgctgtgaa ggtcatggac 
cctcaggctg gggcagcatt 
tggacctcca tgttatttcc 
agttcatgct gtgtgctgga 
ggggcccact tgtctgtaat 
gtgccctgcc cgaaaggcct 
aggacaccat cgtggccaac 
gcccttggaa a 



cctgtccgtg acgtggattg 60 
ctcacagctg cccactgcat 120 
tttcatcctg aagacacagg 180 
tacgatatga gcctcctgaa 240 
ctcatgctgc tccgcctgtc 300 
ctgcccaccc aggagccagc 360 
gaaccagagg agttcttgac 420 
aatgacgtgt gtgcgcaagt 480 
cgctggacag ggggcaaaag 540 
ggtgtgcttc aaggtatcac 600 
tccctgtaca ccaaggtggt 660 
ccctgagcac ccctatacac 720 

751 



<210> 59 
<211> 1908 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7500757CB1 



<400> 59 

ggccggctgg gcgtgcgctc gctccccgaa 
tgggagctgg gccgggtccg gggacagcgg 
aggccgctca gggcagggga cagctggcgc 
aggcggcggc gcccccggcc cgagagcgca 



gccggggctg ggccggagcc gggcgagggc 60 
gcgaggggca gctgccggag ccgggcagcc 120 
cggttctgcg gtctccgggg cccagatgtg 180 
cgatgggggc cccgctcgcc gtagcgctgg 240 
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gcgccctcca ctacctggca cttttcctgc aactcggcgg cgccacgcgg cccgccggcc 300 
acgcgccctg ggacaaccac gtctccggcc acgccctgtt cacagagaca ccccatgaca 360 
tgacagcacg gacgggcgag gacgtggaga tggcctgctc cttccgcggc agcggctccc 420 
cctcctactc gctggagatc cagtggtggt atgtacggag ccaccgggac tggaccgaca 480 
agcaggcgtg ggcctcgaac caggtggtca aggtggtggg cagcaacatc tcccacaagc 540 
tgcgcctgtc ccgggtgaag cccacggacg aaggcaccta cgagtgccgc gtcatcgact 600 
tcagcgacgg caaggcccgg caeca caagg teaaggecta ectgegggtg cagecagggg 660 
agaactccgt cctgcatctg cccgaagccc ctcccgccgc gcccgccccg ccgcccccca 720 
agecaggcaa ggagctgagg aagegctegg tggaccagga ggcctgcagc ctctagactg 780 
atgcccctgc ccccgcccat ccgcccccac gctgtacaga gtgeatgagg agccgccgga 840 
ccaccgggga ccgactgcct gcgtccagcc gtgccccatc cccgaggccg cctgtggcca 900 
ccatgtcggc cctctttcca ccaccccttg ctcagcatgt aagccccacc cacccctgcc 960 
ctttcagacc cctgcggtga cctggctcgg agaaggtggc cctgggcacc aaggggccaa 1020 
ccgccctgaa cactggggca gggaccatgc tggggecegg ggccaccccc ttcctgtcac 1080 
cagcttctgt ggagtccagt gttttgcttt gettgettgt cccccatcct gtcctgagcc 1140 
ggggcccccc agcctcgcct ccctcctcct accatccctc acttggacct gggggtgtgg 1200 
acagtgaccc ctccctgaat atggacttga atcttctgag cagaactagg gcctctcccc 1260 
tggtgaagac ccagggaacc caggagggee cttctggggc agtggctctg cagggtcact 1320 
catggaggee taggggaaca gegagatgee ccaccacctc ctggcgagtc cttcctgttc 1380 
agctccctgt gcgaccctcc agggatgeag gggatccagg attctctgcc ctgtcacacg 1440 
gcgagtcaga agggaggggc ctttccctcg gacccatggc cccaggcaga gttttgeace 1500 
agcaggaccc ctttgagggc cttcaaggct ctcccaggag tccccctctg ccggcccccc 1560 
aatgccccag ctccctcttg ggtcctgtgc caagtccgcc ccagggcctg gggctgttgg 1620 
gagecaaggg ccccctggta ctcagttccc tcacgattcc egatcaeggg cacacctgcc 1680 
ccctggttat ttgtaaatat ttctattgga cccaattctc cteggaattg gctggcacct 1740 
ctggctgcca cagctcagtg atgacgtggg ggaggtggga gaggecgagg gccttgccta 1800 
ggggtgggtt gecctgtata catgatccag tctgtgacta ccagccaacc tgaataaagc 1860 
ggttttaaaa aaaaaaaaaa aaaaaaaata tgcggcgcaa gcttattc 1908 

<210> 60 
<211> 1148 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1730616CB1 

<400> 60 

atccaatgac aagcggctcc ttgagaaagg gggggggggt actccttgaa agcattgtcc 60 
aatgaaaatt accaacggca gaeaeggaga cagcgccggg gcggagggta cgatggagaa 120 
tttcaeggea ctgtttgggg ctcaggctga cccaccaccg cccccaaccg cactcggctt 180 
eggaccagga aagcctccac ccccgccacc gcctcctgcg ggegggggae ccggcacggc 240 
cccgcctccc accgtggcca cggctcctcc tggegeggae aagtcaggag ctggctgtgg 300 
ccctttttac ctcatgaggg aactgecagg tagcacagag ctgacaggca gcacgaatct 360 
gatcacacac tacaacttgg aacaagecta taataaattc tgtgggaaga aggtgaagga 420 
gaagctaagt aacttcctgc ctgacctgcc agggatgatt gatctgectg gttcccatga 480 
taacagcagc ctccgctctc tcattgagaa gccccctatt ctcagtagct ctttcaatcc 540 
tatcacaggg accatgetgg ccggcttccg cctccacact ggcccgttgc eggagcagtg 600 
tegtctgatg catattcagc ctcccaagaa gaagaataag cacaagcaca aacagagecg 660 
tacccaggat cctgtccccc cagaaacacc atctgattca gatcacaaga agaagaaaaa 720 
gaaaaaagaa gaggatcctg aacggaaaag gaagaagaaa gagaagaaga aaaagaagaa 780 
tcgacatagt ccagaccacc caggtatggg cagctcccag gccagcagca gcagcagcct 840 
aegctaatag gaccactgga ctctttgcca ggatggcttt tcctgctgta ctgaacctgc 900 
tgataaaagc tgccttccag gctcttggac actgecttgg gagcatcctg cagctgggac 960 
agaggecage tcctgttggg ctcagttgag actaagtaaa ttaggagaga gagggacatg 1020 
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cttttgtagg ctcatcccag ttggttttct catggacatc tcttcctctc ccaggaagct 1080 

tacaattttc ttctctctct tttgtgcaat ttgtctgatt taggacttgt tctgtgtttt 1140 

ctttaaaa 1148 

<210> 61 

<211> 1776 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 190404CB1 

<400> 61 

agagagagag agagaactag tctcgagcta ttctcgagta tgtaatgatc aacagcatat 60 

ttgaagcaat tttacagata ctttcccatc ccccaagtcg tagggagcat gggtatgatg 120 

ctgtcgtcct cttggctttg ctggtgaact atagaaaata tgagtctgtg aatccttata 180 

ttgtgaagct gtctatcgtg gatgatgagg ccacactcaa tggaatggga cttgtaattg 240 

ctcaggcttt atctgagtac aacaggcagt ataaagacaa ggaagaagaa caccaaagtg 300 

gttttttctc tgctttaaca aatatggtgg gcagcatgtt catagcagat gcccatgaga 360 

aaatctcagt acaaaccaat gaggccattc ttctggcact ttatgaagct gttcatttaa 420 

atcgcaactt catcacagta ttagctcaga gccatccaga aatgggcttg gtgacgaccc 480 

ctgtcagtcc tgctcctaca accccagtca caccacttgg gaccacaccg ccttcctctg 540 

atgttataag ttctgtggaa ctcccactgg atgcagatgt acagaccagt aatctactga 600 

taaccttttt aaagtatagc tcaattgtca tgcaggacac caaagatgaa cacaggcttc 660 

acagtggcaa actctgtctg attatcctta catgtattgc agaggatcaa tatgccaatg 720 

catttttgca tgatgacaac atgaattttc gagtaaactt acatagaatg cctatgagac 780 

acaggaagaa ggcagcagac aagaatcttc cctgccgtcc tttagtatgt gcagtactgg 840 

acctgatggt agagtttatt gtaacacaca tgatgaagga gtttcctatg gatctctata 900 

tacgctgcat ccaggtagta cacaaactgc tctgctacca gaagaagtgt cgggtacgcc 960 

tgcattacac ctggcgggag ctctggtcag ccttgataaa tttgctgaag ttccttatgt 1020 

caaatgagac tgtacttttg gccaaacaca acatttttac attagccctt atgattgtga 1080 

acctatttaa tatgtttatc acatatggcg acacatttct gccaaccccc agcagctatg 1140 

atgaacttta ctatgagatt atccgcatgc accagagctt tgacaacctc tactccatgg 1200 

tcctgaggct ttctaccaat gcaggccagt ggaaggaagc agctagcaag gtgacccatg 1260 

cattggttaa tatcagagcc atcatcaacc actttaaccc caaaattgag tcctacgctg 1320 

ctgtgaatca catatcccaa ctgtcagagg agcaggtgct ggaggtggtg agagccaact 1380 

atgacacgct cacgctgaag ctgcaggatg gcctggacca gtatgagcgc tactcagagc 1440 

agcacaagga agctgccttc ttcaaagagc tggttcgatc cattagcacc aacgtccgga 1500 

gaaacctggc cttccacaca ctcagccaag aagtcctgct caaggagttc tccactatct 1560 

cctgaggcca cgcctacctg agcagcctct gactgccctt acccatgagg atcatgggct 1620 

ggagggggga gcgaggggag agggggctgc ccccgaggtt ggagagaaac acagatacca 1680 

agttctcttg gtgaagaccg cacttcaatg gagcttgggc agcagggcag gaggtcaagg 1740 

ggatccacta gttctaaacg ccccacccgc gtgcca 1776 

<210> 62 

<211> 1765 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7500679CB1 

<400> 62 

ctctgagcag aactgacagc atgaaggtac ggggcccagg gtcgggggac tcatagcatg 60 
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ggggaactga gcccactcca gaggcccctg 
gttctcttga aactcgcact cctggccctg 
tgtgcccccc aggtctccgg gatccgagga 
agtaagtttc ttccgcggct ggaatgacta 
gtactggctg gggctgcaga acatgcacct 
agtggacttg gaggactttg agaacaacac 
ctccccgaac gcggtcagcg cagaggagga 
ggatggcggg gcaggtgact ccctgtccta 
ccgggaccag gacctctttg tgcagaactg 
ccgcagctgc cactttgcca acctcaatgg 
tgccaatggc atcaactggg cccagtggaa 
gatgaaaatc cgccgggcct gaagggctgg 
cccatggtct ccatgagtgc tccctctgct 
agcaccaact ccttacaagg gggccttgtg 
acccagggca tccattccta agccagaccc 
gcagttcccc aggcctcttc cgagaggcac 
atgagaatga gttgcctcca ccctgtccca 
ctgcaggcct ttgtggtcca tcttgtcctg 
ccctagtgtg ctaagctgag cagtgcagtg 
cagcccagca ctcctctggg ctgctctcaa 
caacttactc ccctgtggtg agtaccgtgg 
cttcccctga tggcccatcc cagcccctga 
accaccctga aacagccaca gccaaggtag 
tgggtgttga gaggtttagc cacatgaggg 
tgggtaggga ctgaaggtct caataaacct 
acaaacatat ttgtttatcc cccaaatgta 
atggtgtcct accccgaact ccaaaaatta 
gtatttcaca ggcctcattt tgatggcaaa 
aaaaatataa tactgaaaaa aaaaa 



gccacagagg gcactatgaa ggcacaagga 120 
ccgctgctgc tgcttctctc cacgcccccg 180 
gatggttttc cagaagagat tcaatggctc 240 
caagctgggc ttcggccgtg ctgatggaga 300 
cctgacactg aagcagaagt atgagctgcg 360 
ggcctatgcc aagtacgctg acttctccat 420 
tggctacacc ctctttgtgg caggctttga 480 
ccacagtggc cagaagttct ctaccttcga 540 
cgcagctctc tcctcaggag ccttctggtt 600 
cttctaccta ggtggctccc acctctctta 660 
gggcttctac tactccctca aacgcactga 720 
ccccctcagg cacctttcct cccctggaca 780 
gcccctgatg catgcttctg ctgattcccg 840 
gctctcagcc atgccacatc cctgtcacac 900 
ggctccccta cacctgaagt tacactgcca 960 
atggttctag cctggacctg gctgggctcc 1020 
acagctgaca gccaggagcc actctcccag 1080 
cttcctcact gtggacccct gtctgggcca 1140 
tgaacagggc ccatggtgta ttctaggcca 1200 
accatgtccc atcttcagca tccctcccac 1260 
aaccccagcc cacctcacta tcatactcag 1320 
agctctatgc caagaacaca gctaccgcac 1380 
gcatgcatat gaggtcttcc ccataccctc 1440 
agcagaggac aatctctgca gggctgggag 1500 
tcagaacctg aatgaactgg cttcatacac 1560 
ggcacctggc tcctccttgc tcccctgctg 1620 
cacctggagt caggtgcaga agggaacctt 1680 
aagacagtgt aataataaca taataataat 1740 

1765 



<210> 63 
<211> 1695 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7500687CB1 



<400> 63 

gttgctccgg cggcgctcgg 
tggcgctgcc tccaggccca 
tgagctcagg cccctcccgg 
tcaacaacgt gcgcctgcca 
acctcactcc agattccaga 
gccggccaga gcttctggac 
gcctcccagt gctgggctat 
tctgagccga gggcgagaca 
aaggcatcct ctacctagct 
cttgccctcc tggctggggt 
aacccgcagg ggcacaggta 
tgactcaggg tgggccctgc 
ggggtgttca gatgtcatcc 
agtactccac agcaccttgt 
ggccctgcat ggattttcct 
taggaccctg ctagctgtgc 



ggagggagcc agcagcctag 
gccgccctcc ggcacacact 
cacccatctc tgatatcaag 
cgggagaaca tgtccctccc 
gcagtgaaac cagcagaccg 
ccggagcccg gcggcctcct 
atctatcgag tgtccagcgt 
ggagtattct cttggcctct 
aggtcaccaa cgtgaagaag 
gccctccatg tcatgcacgt 
tctttggcaa ggctaccagt 
atgtgatgac tgggcccttc 
agcatccaag tgtggcatgg 
acagtaggca tgggggcgtg 
ccttcctatg ctatgtagcc 
agaacccaat tgccctttgc 



ggcctaggcc cgggccacca 60 
gctgctcctg ccagcccttc 120 
tgactccaac aacctaaaac 180 
gtccaacctt cagctcaatg 240 
gcagatggct cagaacaaca 300 
caccagccaa ggtttcatcc 360 
gagcagtgat gagatctggc 420 
ggacaccctc ccattcctcc 480 
ttatgccact gccacttttg 540 
gatgcatttc actgggctgt 600 
tggacgtaag cccctcatgc 660 
cagagggagc tctttggcca 720 
cctgctgtat accccacccc 780 
cctgtgtggg ggacagggag 840 
ttgttccctc aggtaaaatt 900 
acagaaacca acccctgacc 960 
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cagcggtacc ggccaagcac aaacgtcctt 

ttctcttctg tccccacctc ctcttgggaa 

acttcactgg gcactagact tttctattgg 

tagggaggaa gaggcaggcg atgagtctag 

cattttccta cggcgttagc actttaagca 

cccagagccc tctttgtggc ttccccacgt 

tgagctctcg gggttgatgg tttttctctc 

cctgaccaac ccatggttgc ctcatcagca 

acaggcacat aattcaacag tgtggaagct 

cataccccaa agtgacctaa gaacacttta 

tatagtacag aatatatttt tcccttgttg 

ctgcctaggg egg t get gag cacacagcaa 
aaaaaaaaaa aaaaa 



tttgetgeae acgtctctgc ccttcacttc 1020 
ttctaggtta cacgttggac cttctctact 1080 
cctgtgccat cgcccagtat tagcacaagt 1140 
tagcacccag gaeggcttgt agetatgeat 1200 
catcccctag gggagggggt gagtgagggg 1260 
ttggccttct gggattcact gtgagtgtcc 1320 
agcatgtctc ctccaccacg ggaccccagc 1380 
ggaaggtgcc cttcctggag gatggtcgee 1440 
ttaggggaac atggagaaag aaggagacca 1500 
aaaagcaaca tgtaaatgat tggaaattaa 1560 
agatcttctt ttgtaatgtt tttcatgtta 1620 
gtttaataaa cttgactgaa ttcatttaca 1680 

1695 



<210> 64 

<211> 2202 

<212> UNA 

<213> Homo sapiens 



<220> 

<221> mi sc_ feature 

<223> Incyte ID No: 7500688CB1 

<400> 64 

gttgetcegg cggcgctcgg ggagggagee 
tggcgctgcc tccaggccca gccgccctcc 
tgagctcagg ggggcctggc acccccagat 
aggccagcac ctcaagactg ctgagcgtgg 
ccttcactgt cactgcccat cgggcccagc 
gaagtggccg atcagccaac gcctctgtca 
cccaagtcgg cgccaagtac caggaagctc 
ccctggtgcg tgccaacccg ccggccaatg 
ctgtcaacac ctctgacttc ctggtgctgg 
acaeggtgea gctgcagctc cgcagcctgg 
acgtgggtgt caccagtgcg tcgcttccag 
tatcaagtga ctccaacaac ctaaaactca 
ccctcccgtc caaccttcag ctcaatgacc 
cagaccggca gatggctcag aacaacagcc 
gcctcctcac cagecaaggt ttcatccgcc 
ecagegtgag cagtgatgag atctggctct 
ggcctctgga caccctccca ttcctccaag 
gaagaagtta tgccactgcc acttttgett 
tgcacgtgat gcatttcact gggctgtaac 
taccagttgg acgtaagccc etcatgetga 
gcccttccag agggagctct ttggccaggg 
ggcatggcct gctgtatacc ccaccccagt 
gggcgtgcct gtgtggggga cagggagggc 
tgtagccttg ttccctcagg taaaatttag 
cctttgcaca gaaaccaacc cctgacccag 
gctgcacacg tctctgccct tcacttcttc 
taggttacac gttggacctt ctctactact 
gtgccatcgc ccagtattag cacaagttag 
cacccaggac ggcttgtagc tatgeatcat 
cccctagggg agggggtgag tgaggggece 
gccttctggg attcactgtg agtgtcctga 
atgtctcctc caccacggga ccccagccct 



ageagectag ggcctaggcc cgggccacca 60 
ggcacacact gctgctcctg ccagcccttc 120 
tggcctggta tctggatgga cagctgeagg 180 
gaggggaggc cttctctgga ggcaccagca 240 
atgagctcaa ctgctctctg caggacccca 300 
tccttaatgt gcaattcaag ccagagattg 360 
agggeccagg cctcctggtt gtcctgtttg 420 
tcacctggat cgaccaggat gggccagtga 480 
atgcgcagaa ctacccctgg ctcaccaacc 540 
cacacaacct ctcggtggtg gccaccaatg 600 
ccccaggccc ctcccggcac ccatctctga 660 
acaacgtgcg cctgccacgg gagaacatgt 720 
tcactccaga ttccagagca gtgaaaccag 780 
ggecagaget tctggacccg gagcccggcg 840 
tcccagtgct gggctatatc tatcgagtgt 900 
gagecgaggg cgagacagga gtattefcett 960 
gcatcctcta cctagctagg tcaccaacgt 1020 
gccctcctgg ctggggtgcc ctccatgtca 1080 
ccgcaggggc acaggtatct ttggcaaggc 1140 
ctcagggtgg gccctgcatg tgatgactgg 1200 
gtgttcagat gtcatccagc atccaagtgt 1260 
actccacagc accttgtaca gtaggcatgg 1320 
cctgcatgga ttttcctcct tectatgeta 1380 
gaccctgcta gctgtgcaga acccaattgc 1440 
cggtaccggc caagcacaaa cgtccttttt 1500 
tcttctgtcc ccacctcctc ttgggaattc 1560 
tcactgggca ctagactttt etattggect 1620 
ggaggaagag geaggegatg agtctagtag 1680 
tttcctaegg cgttagcact ttaagcacat 1740 
agagccctct ttgtggcttc cccacgtttg 1800 
getctegggg ttgatggttt ttctctcagc 1860 
gaccaaccca tggttgcctc atcagcagga 1920 
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aggtgccctt cctggaggat ggtcgccaca ggcacataat tcaacagtgt ggaagcttta 1980 
ggggaacatg gagaaagaag gagaccacat accccaaagt gacctaagaa cactttaaaa 2040 
agcaacatgt aaatgattgg aaattaatat agtacagaat atatttttcc cttgttgaga 2100 
tcttcttttg taatgttttt catgttactg cctagggcgg tgctgagcac acagcaagtt 2160 
taataaactt gactgaattc atttacaaaa aaaaaaaaaa aa 2202 

<210> 65 

<211> 779 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7500697CB1 



<400> 65 

ctccctccct cgccttcctc 
ttgatttcag gcggacacca 
ccagaagcaa ggctaggagc 
ctgctcacgg cagtgccacc 
ccgtcggaaa caaggcagat 
cggccagttt caggtccagt 
gcaaggtccg cagcaccccg 
cacatgggat ggatgtccag 
caatagtggg ctgactccct 
agccaaggtt ggagctcaga 
ctgctcagct cagaagagcc 
cacacgtcct tccccataac 
gtatttctgg ggctgaggct 



cttcctagct cctctcctcc 
atagactcca cagcagctcc 
tgctgcagcc atgtcggccc 
tgccagctgt cagcaaggga 
ggagtcctgg tgggaacaga 
gagcatgaga atgcctatga 
atgtaacctt ctctgtggct 
tgctaccacc caagccccct 
ccagccccat gccggcccta 
ccgtgtctag gttggggctc 
ttctggagag gacagtcagc 
tatggaaatg gccctaattt 
cagcaacagc ccctcaggct 



agggccagac tgagcccagg 60 
aggagcccag acaccggcgg 120 
tcagcctcct cattctgggc 180 
gcctgcacac atgatcctga 240 
tggaaggtac tcttcgatgg 300 
gaatgtgccc gaggaggaag 360 
ccaaccccaa gactcccagg 420 
ccttctttgt gtggaatctg 480 
cccgcccttg aagtatagcc 540 
ggctgtggcc ctggggtctc 600 
tgagcacctc ccatcctgct 660 
ctgtgaaata aagacttttt 720 
tccagtgaaa aaaaaaaaa 779 



<210> 66 

<211> 3104 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> nvisc_f eature 

<223> Incyte ID No: 7500709CB1 



<400> 66 

aggacaaggt cgggagagga ggaggagggg 

gacggagagg agcgaggacg gaggggggac 

tccattgcga cagcgacaaa caggacaccg 

tccccgttgc cgagcaaccg aagcatggcg 

gcccccgccc gcccggctgg tcggaagtga 

ggtgagaggc cgcggcggca ggtccacctg 

agctcacgac cagatgcacc agcaggagtc 

cgaccagtga ccaggagtta aactttggga 

agaggccgaa atccacccct tgaaaaatga 

tccatcaaaa aatgaggata acgtgaaaag 

ccgagcggcg gcgttttttc tttcattgtt 

attcgtcatc ccgtgtccag accggccggc 

tgccgctgtt atctatgact ttctggctgt 

tgttcttttt ctttataaaa acaccaacag 

cgaaggcttt tcctctccct gcacctttgc 

gctctgggag agacctgtgg cccaagacgt 



agcgaggacg cagagaagga ggggaaagag 60 
gggaacgaag aggggcgagg acgttcagca 120 
ccaggttccc ggcggaggca gcgcccaaga 180 
gtggggcggg gtcggagcct gcgcatttcc 240 
cgccaggggg cggggccagc ggcgcggtcg 300 
ggcttgcgaa ggcacagatt ccccgtccac 360 
cacatcgagg acgtcctccg ggcactccca 420 
tgtgcccgtg atgttggacc acaaggactt 480 
agaaagaaaa tcgcaggaaa atctgggaaa 540 
cgcgcctcca cagtcccggc tctcccggtg 600 
tctctgcctt tttgtggtgt tcgtcgtctc 660 
gtcacagcga atgtggagga tagactacag 720 
ggatgatata aacggggaca ggatccaaga 780 
cagcaacaat ttcagccgat cctgtgtgga 840 
agctgctgtg tcgggggcca acggcagcac 900 
ggccctcgtg gagtgtgctg tgccccagcc 960 
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aagaggcagt gaggcacctt ctgcctgcat 
agtcaacttg ttcacagggg aaaccctgtg 
gtccatcctg agccctctgc tgcaggtgcc 
gctggttctc acccaggagc gggaggaggt 
gcaccagatt ggcctcagag gcagccttgg 
cgtcaccagg acaggtgccc actacatcct 
ctctgtgaag ggtctctacg agaaggtgac 
gcactgggag agcatgctca atgccaccac 
agtgcgctac ctgatgcatg tcccagggaa 
agaggccttc gtgctgctgg acgggcagga 
ccatgtcctg agaaaaccca tcttcggccg 
tgaaaacgga actggcaccg acagacagga 
gttccacccc accctgccgc gcgtgctgct 
cgcctttgac gccgtcctgt ttgagccaag 
cccggcagac tcagaggcac ccggcctggt 
tgtcccaagc agcagggtgg tccgcctggg 
cagggaccgg ttctcccggc tgcggtacca 
cctgtggaga gactccgcct gctgacacta 
tctctgcact gactccccca ctcctgaccc 
cttaccagtc ctccatgatc acacccaggg 
tctctcccgc ccagcatcct ccctgagtcc 
gggtcccgct cacaccaggc agccttcata 
gggtcatgca gcaccccatc cttacccggt 
agagcggcca tcgcgtagaa agaaccaggg 
tcctgtagaa gtccattccc cttttccctc 
ctcagggtac tgggcctcaa cgggaacctg 
agctacaggg ggatcctcta gcatgggggg 
gaggaagcct ggagcaaggg tctcccccag 
gcccgtggcc gctcacaggt gtccttagtg 
tgtgaatatc ccaaggaact ggctgtggaa 
gtagacactg gagctgctct gtccctgaag 
tgcctctccc cttccggtgc tcacacgccc 
ctgtgccttg ctgctcctga ggcccaaggg 
cccaggtcac agagcctgag cttcgtagcc 
agaaagcaga ataaacattt ttgcactgcc 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



cctggtgggc agacccagtt ctttcattgc 1020 
gaaccacagc agcagcttca gcgggaatgc 1080 
tgatgtggac ggcgatgggg ccccagacct 1140 
tagtggccac ctctactccg gcagcaccgg 1200 
tgtggacggg gaaagtggct tcctccttca 1260 
ctttccctgc gcaagctccc tctgcggctg 1320 
cgagagcggc ggcccgttca agagtgaccc 1380 
ccgcaggatg ctttcccaca gctctggagc 1440 
cgccggtgca gatgtgcttc ttgtgggctc 1500 
gctgacgcct cgctggacac ccaaggcagc 1560 
ctacaaacca gacaccttgg ctgtagccgt 1620 
gaccggggag gcccggcaca gcctgtacat 1680 
ggagctggcc aatgtctcta cccacattgt 1740 
ccgccacgcc gcctacatcc ttctgacagg 1800 
ctctgtgatc aagcacaagg tgcgggacct 1860 
tgagggtggg ccagacagtg accaagccat 1920 
gagtgaggcg tagaggcacg ccagccagag 1980 
aacgtcctgg gaagtgggcc cttccctggg 2040 
tggtgatggt cgccactggg cagcagcagc 2100 
acctgcatgg gtgaggggac accctgggcc 2160 
ccacacaggg cctcactctg caccccacca 2220 
gtggtctccc tggccacctt gggcagagct 2280 
gccctctcct tgccagcttc tccccaggcc 2340 
tgtccccggg acaggccgtc ccccacccca 2400 
ctgtgctctg tcccccaagg agtcatggaa 2460 
agacagctcc agcttcgcag cccttcccgg 2520 
tgtgacttgg ttcctttgac caggtcctgt 2580 
caggatgggt ggggcctgct ctggagctga 2640 
gtgttgcagc tgtctactgg ctgcatgtgc 2700 
tgcgtgtttg ggtcagtctg tgccctctca 2760 
aggccccgtg ccccaggcat ggcaagcgcc 2820 
acgccgtgcc acccgatgca ggactcacct 2880 
cagccatggt get ctg tact gctcgggccg 2940 
aaagcagect gatgacccac ccaccaagga 3000 
tgaaaaaccc cggtggtcag gcgtgagcct 3060 
aaaatggtgc ggtc 3104 



<210> 67 

<211> 3233 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_£eature 

<223> Incyte ID No: 7500711CB1 

<400> 67 

aggacaaggt egggagagga ggaggagggg 
gaeggagagg agegaggacg gaggggggac 
tecattgega cagegacaaa caggacaccg 
tccccgttgc cgagcaaccg aagcatggcg 
gcccccgccc gcccggctgg tcggaagtga 
ggtgagaggc cgcggcggca ggtccacctg 
agctcacgac cagatgcacc agcaggagtc 
cgaccagtga ccaggagtta aactttggga 
agaggecgaa atccacccct tgaaaaatga 



agegaggacg cagagaagga ggggaaagag 60 
gggaacgaag aggggegagg aegttcagea 120 
ccaggttccc ggeggaggea gcgcccaaga 1B0 
gtggggcggg gteggagect gcgcatttcc 240 
cgccaggggg cggggccagc ggegeggteg 300 
ggcttgcgaa ggcacagatt ccccgtccac 360 
cacatcgagg acgtcctccg ggcactccca 420 
tgtgcccgtg atgttggacc acaaggactt 480 
agaaagaaaa tegcaggaaa atctgggaaa 540 
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tccatcaaaa aatgaggata acgtgaaaag 
ccgagcggcg gcgttttttc tttcattgtt 
attcgtcatc ccgtgtccag accggccggc 
tgccgctgtt atctatgact ttctggctgt 
tgttcttttt ctttataaaa acaccaacag 
cgaagctgct gtgtcggggg ccaacggcag 
cgtggccctc gtggagtgtg ctgtgcccca 
catcctggtg ggcagaccca gttctttcat 
gtggaaccac agcagcagct tcagcgggaa 
gcctgatgtg gacggcgatg gggccccaga 
ggttagtggc cacctctact ccggcagcac 
tggtgtggac ggggaaagtg gcttcctcct 
cctctttccc tgcgcaagct ccctctgcgg 
gaccgggagc ggcggcccgt tcaagagtga 
cacccgcagg atgctttccc acagctctgg 
gaacgccggt gcagatgtgc ttcttgtggg 
ggagctgacg cctcgctgga cacccaaggc 
ccgctacaaa ccagacacct tggctgtagc 
gatcctgttt ctggaccttg gcactggagc 
ccctgggggt ccactgtccg ccagcctgcc 
ctggggcctc cacgagctgg ggagcaccag 
cctgtacatg ttccacccca ccctgccgcg 
ccacattgtc gcctttgacg ccgtcctgtt 
tctgacaggc ccggcagact cagaggcacc 
gcgggacctt gtcccaagca gcagggtggt 
ccaagccatc agggaccggt tctcccggct 
cagccagagc ctgtggagag actccgcctg 
ttccctgggt ctctgcactg actcccccac 
agcagcagcc ttaccagtcc tccatgatca 
ccctgggcct ctctcccgcc cagcatcctc 
accccaccag ggtcccgctc acaccaggca 
ggcagagctg ggtcatgcag caccccatcc 
ccccaggcca gagcggccat cgcgtagaaa 
cccaccccat cctgtagaag tccattcccc 
gtcatggaac tcagggtact gggcctcaac 
ccttcccgga gctacagggg gatcctctag 
aggtcctgtg aggaagcctg gagcaagggt 
tggagctgag cccgtggccg ctcacaggtg 
tgcatgtgct gtgaatatcc caaggaactg 
gccctctcag tagacactgg agctgctctg 
gcaagcgcct gcctctcccc ttccggtgct 
gactcacctc tgtgccttgc tgctcctgag 
ctcgggccgc ccaggtcaca gagcctgagc 
caccaaggaa gaaagcagaa taaacatttt 
cgtgagccta aaaaaaaaaa aaaaaaaaaa 

<210> 68 

<211> 1808 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7500723CB1 

<400> 68 



cgcgcctcca cagtcccggc tctcccggtg 600 
tctctgcctt tttgtggtgt tcgtcgtctc 660 
gtcacagcga atgtggagga tagactacag 720 
ggatgatata aacggggaca ggatccaaga 780 
cagcaacaat ttcagccgat cctgtgtgga 840 
cacgctctgg gagagacctg tggcccaaga 900 
gccaagaggc agtgaggcac cttctgcctg 960 
tgcagtcaac ttgttcacag gggaaaccct 1020 
tgcgtccatc ctgagccctc tgctgcaggt 1080 
cctgctggtt ctcacccagg agcgggagga 1140 
cgggcaccag attggcctca gaggcagcct 1200 
tcacgtcacc aggacaggtg cccactacat 1260 
ctgctctgtg aagggtctct acgagaaggt 1320 
cccgcactgg gagagcatgc tcaatgccac 1380 
agcagtgcgc tacctgatgc atgtcccagg 1440 
ctcagaggcc ttcgtgctgc tggacgggca 1500 
agcccatgtc ctgagaaaac ccatcttcgg 1560 
cgttgaaaac ggaactggca ccgacagaca 1620 
cgtcctgtgt agcctagccc tcccgagcct 1680 
gaccgcagac caccgctcag ccttcttctt 1740 
cgagacggag accggggagg cccggcacag 1800 
cgtgctgctg gagctggcca atgtctctac 1860 
tgagccaagc cgccacgccg cctacatcct 1920 
cggcctggtc tctgtgatca agcacaaggt 1980 
ccgcctgggt gagggtgggc cagacagtga 2040 
gcggtaccag agtgaggcgt agaggcacgc 2100 
ctgacactaa acgtcctggg aagtgggccc 2160 
tcctgaccct ggtgatggtc gccactgggc 2220 
cacccaggga cctgcatggg tgaggggaca 2280 
cctgagtccc cacacagggc ctcactctgc 2340 
gccttcatag tggtctccct ggccaccttg 2400 
ttacccggtg ccctctcctt gccagcttct 2460 
gaaccagggt gtccccggga caggccgtcc 2520 
ttttccctcc tgtgctctgt cccccaagga 2580 
gggaacctga gacagctcca gcttcgcagc 2640 
catggggggt gtgacttggt tcctttgacc 2700 
ctcccccagc aggatgggtg gggcctgctc 2760 
tccttagtgg tgttgcagct gtctactggc 2820 
gctgtggaat gcgtgtttgg gtcagtctgt 2880 
tccctgaaga ggccccgtgc cccaggcatg 2940 
cacacgccca cgccgtgcca cccgatgcag 3000 
gcccaagggc agccatggtg ctctgtactg 3060 
ttcgtagcca aagcagcctg atgacccacc 3120 
tgcactgcct gaaaaacccc ggtggtcagg 3180 
aaaaaaaaaa aaatggtgcg gtc 3233 
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ggagcacggc gctggggccg cccgcagcgc 
tccccgcgcc ggccgcgtcc cgcccgctcc 
cggcgacatg ggcgtcccca cggccccgga 
cttcgctctc ttcctggctg cgtccctagg 
ctgccaacac cagccacgac ctggctcagc 
atggcaactt ctccatcacc atgcgcaacc 
gcctggtggt ggagatcagg caccaccact 
tgcaggtgca gacaggcaaa gatgcaccat 
aggagagtga aaacatcacg gctgcagccc 
tctgcctccc cctcatcctg ctcctggtct 
gtgcccagga gctggtgcgg atggacagca 
aagcctcacc acctgcccag gggatacccg 
tggcccagcg gcagccttct gagtctgggc 
tgtctcctcc aggccccgga gacgtcttct 
caaactttga ggtcatctag cccagctggg 
gcaggtgcat ttgagccagg gctggctctg 
ccctccctcc tgctctgggc tcagatactg 
ctctggatgc tacatgggga tgctggacgg 
ctgagattct cccctagaga cctgaaattc 
ttaagaagtc tcagaacgtc cagcccttca 
gatgtggcag catcagtggg acaagatgga 
agggcacggt ggagagactt ctcccccgtg 
ctgctcttct gtcagacttc ctctttgtac 
ccactggcca tcgccacctt ccccagctgc 
caacaggtta agtcaatctg gggcttccac 
gtcccgaaac gggaagtaca tattggggca 
ggcaatctga ggccaggaca gatgttgccc 
ggtggggcct tctgggaagg tgagtggaga 
ctactcccac tgctcagcgc gggccattgc 
gggacacttc tgagtatgaa gcgggatgct 
aaaaaaaa 



tcactcgctc gcactcagtc gcgggaggct 60 
ccggcaccag aagttcctct gcgcgtccga 120 
ggccggcagc tggcgctggg gatccctgct 180 
accttcacct gcaccatgga ggccaccagg 240 
gccacgggct ggagtcggcc tccgaccacc 300 
tgaccctgct ggatagcggc ctctactgct 360 
cggagcacag ggtccatggt gccatggagc 420 
ccaactgtgt ggtgtaccca tcctcctccc 480 
tggctacggg tgcctgcatc gtaggaatcc 540 
acaagcaaag gcaggcagcc tccaaccgcc 600 
acattcaagg gattgaaaac cccggctttg 660 
aggccaaagt caggcacccc ctgtcctatg 720 
ggcatctgct ttcggagccc agcacccccc 780 
tcccatccct ggaccctgtc cctgactctc 840 
ggacagtggg ctgttgtggc tgggtctggg 900 
tgagtggcct ccttggcctc ggccctggtt 960 
tgacatccca gaagcccagc ccctcaaccc 1020 
ctcagcccct gttccaagga ttttggggtg 1080 
accagctaca gatgccaaat gacttacatc 1140 
gcagctctcg ttctgagaca tgagccttgg 1200 
cactgggcca ccctcccagg caccagacac 1260 
gccgccttgg ctcccccgtt ttgcccgagg 1320 
cacagtggct ctggggccag gcctgcctgc 1380 
ctcctaccag cagtttctct gaagatctgt 1440 
tgcctgcatt ccagtcccca gagcttggtg 1500 
tggtggcctc cgtgagcaaa tggtgtcttg 1560 
cacccactgg agatggtgct gagggaggtg 1620 
ggggcacctg ccccccgccc tccccatccc 1680 
aagggtgcca cacaatgtct tgtccaccct 1740 
attaaaaact acatggggaa acaggtgcaa 1800 

1808 



<210> 69 
<211> 1672 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7500764CB1 



<400> 69 

cggccgcacc cggccggagc ggagggcaga 
ggagcgcggc gtgcgggagg gcccagagca 
ctgctcagat acctgttccc ggccctcctg 
agttcatcac aaggtcctac agattctacc 
attatggcag caaccggtgc acaaatccca 
gcctccgcca ggcctcctgg atcatccgag 
acgtccaggt gatctgccac acagaggcgg 
gcacatgccc gggcttcctc gcagacgggg 
tctggcgaga ggagaacggc tgtgagtgca 
ttcagctcat ccgggtggag aagcagtacc 
agctcttcct tggtgacatt cacactgatg 
gttaccagcc ccctctgcag aatgccaaga 
tcatctatcg gtcagacgag caccaccctc 
tcggcctgca cggggagtgg gtgagccagc 



gcgcgcgccc agttgcccgg gcaccaaatc 60 
ggactggaaa tgtcctggcc gcgccgcctc 120 
cttcacgctg tgaagtaagg tcaggcccag 180 
acaataacac cttcaaggcc taccaatttt 240 
cttatactct catcatccgg ggcaagatcc 300 
ggggcacgga agccgactac cagctgcaca 360 
tggccgagaa gctgggccag caggtgaacc 420 
gtccctgggt gcaggacgtg gcctatgacc 480 
ccaaggccgt gaactttgcc atgcatgaac 540 
ttcaccacaa cctcgaccac ctggtcgagg 600 
ccacccagag gatgttctac cggccctcca 660 
accacgacca tgcctgcatc gcctgtcgga 720 
ccatcctgcc cccaaaggca gacctgacca 780 
gctgtgaggt gcgccccgaa gtcctcttcc 840 
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tcacccgcca cttcatcttc 
actcagaccc ggtgtgcaag 
gcggcgtcct ctcgtccagg 
tgaaggtcac ccccatggat 
agtgcggggc cgagggctcc 
gctgcgtggc cctgggcatc 
aggatgcccg ggggcgctat 
cagacaggcc agagaagaga 
cttcgccgag ggcagaggac 
ctgggaggca cacctggtcc 
attggaacat ccgcagatag 
actattgaca gattttcttt 
gccagagaac tgtccttctt 
gcaaggagtg tttgaagttt 

<210> 70 
<211> 1347 
<212> UNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID Tk>i 7500772CB1 

<400> 70 

cactgcagcg caggtgagct ctcctgagga cctctctgtc agctcccctg attgtaggga 60 
ggatccagtg tggcaagaaa ctcctccagc ccagcaagca gctcaggatg ttcctgaagg 120 
ccgtggtcct gaccctggcc ctggtggctg tcgccgtgcc ctcttccagg acaaacttgg 180 
agaagtgaac acttacgcag gtgacctgca gaagaagctg gtgccctttg ccaccgagct 240 
gcatgaacgc ctggccaagg actcggagaa actgaaggag gagattggga aggagctgga 300 
ggagctgagg gcccggctgc tgccccatgc caatgaggtg agccagaaga tcggggacaa 360 
cctgcgagag cttcagcagc gcctggagcc ctacgcggac cagctgcgca cccaggtcaa 420 
cacgcaggcc gagcagctgc ggcgccagct gaccccctac gcacagcgca tggagagagt 480 
gctgcgggag aacgccgaca gcctgcaggc ctcgctgagg ccccacgccg acgagctcaa 540 
ggccaagatc gaccagaacg tggaggagct caagggacgc cttacgccct acgctgacga 600 
attcaaagtc aagattgacc agaccgtgga ggagctgcgc cgcagcctgg ctccctatgc 660 
tcaggacacg caggagaagc tcaaccacca gcttgagggc ctgaccttcc agatgaagaa 720 
gaacgccgag gagctcaagg ccaggatctc ggccagtgcc gaggagctgc ggcagaggct 780 
ggcgcccttg gccgaggacg tgcgtggcaa cctgaggggc aacaccgagg ggctgcagaa 840 
gtcactggca gagctgggtg ggcacctgga ccagcaggtg gaggagttcc gacgccgggt 900 
ggagccctac ggggaaaact tcaacaaagc cctggtgcag cagatggaac agctcaggca 960 
gaaactgggc ccccatgcgg gggacgtgga aggccacttg agcttcctgg agaaggacct 1020 
gagggacaag gtcaactcct tcttcagcac cttcaaggag aaagagagcc aggacaagtc 1080 
tctctccctc cctgagctgg agcaacagca ggaacagcag caggagcagc agcaggagca 1140 
ggtgcagatg ctggcccctt tggagagctg agctgcccct ggtgcactgg ccccaccctc 1200 
gtggacacct gccctgccct gccacctgtc tgtctgtctg tcccaaagaa gttctggtat 1260 
gaacttgagg acacatgtcc agtgggaggt gagaccacct ctcaatattc aataaagctg 1320 
ctgagaatct agcctcaaaa aaaaaaa 1347 

<210> 71 

<211> 729 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi bc_ feature 



catgacaaca acaacacctg ggagggccac tactaccact 900 
caccccacct tctccatcta cgcccggggc cgctacagcc 960 
gtcatgggag gcaccgagtt cgtgttcaaa gtgaatcaca 1020 
gcggccacag cctcactgct caacgtcttc aacgggaatg 1080 
tggcaggtgg gcatccagca ggatgtgacc cacaccaatg 1140 
aaactacctc acacggagta cgagatcttc aaaatggaac 1200 
ctgctgttca acggtcagag gcccagcgac gggtccagcc 1260 
gccacgtcct accagatgcc cttggtccag tgtgcctcct 1320 
ctcgcagaag acagtggaag cagcctgtat ggccgggccc 1380 
ctgctgctgg, ctgcacttgc ctgccttgtc cctctgctgc 1440 
aagttttaga aagttctatt tttccaaacc aggattcctt 1500 
accaaaagaa aagacattta ttcttttgat gcacttgaat 1560 
tttctcctct ccctccctcc cagcccctga gtcatgaaca 1620 
ctgctttgaa ctccgtccag cctgatccct gg 1672 
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<223> Incyte ID No: 7501350CB1 
<400> 71 

gctggtgctg gtacaggttg gggtcgctga tgaacgcgaa gatgatgcag gcaacgaagg 60 

tctccagcac cttcagcagc ccgggtacgg tggccatata gccagtgatc tcgccgggcg 120 

gcgtggcagg aagaggggga ctctgtggtc agggaacttc tcgctgagca cagctgcaca 180 

gtgctgtcag aacggccgat ctccagccca agatgattcc agcagtggtc ttgctcttac 240 

tccttttggt tgaacaagca gatccaagtg cgaaaggcag ctataaccag ctatgagaaa 300 

tcagatggtg tttacacggg cctgagcacc aggaaccagg agacttacga gactctgaag 360 

catgagaaac caccacagta gctttagaat agatgcggtc atattcttct ttggcttctg 420 

gttcttccag ccctcatggt tggcatcaca tatgcctgca tgccattaac accagctggc 480 

cctaccccta taatgatcct gtgtcctaaa ttaatataca ccagtggttc ctcctccctg 540 

ttaaagacta atgctcagat gctgtttacg gatatttata ttctagtctc actctcttgt 600 

cccacccttc ttctcttccc cattcccaac tccagctaaa atatgggaag ggagaacccc 660 

caataaaact gccatggact ggaaaaaaaa aaaaaaaaaa ttcctctggt cgaacttttc 720 
ccttacctt 729 



<210> 72 

<211> 1188 

<212> UNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7506396CB1 

<220> 

<221> unsure 

<222> (1) ... (1188) 

<223> a, t, c, g, or other 



<400> 72 

aggactgctt tatggacttg catcaggagt 
aggtggtgtg ggcagcaaag cggctgnctc 
gagnctcctg cttctgctgg tgttggccaa 
ctacagacaa gccattatgt ccttcaagaa 
tgtatcatgt tgaagaaagg aattcacacc 
tccttacgga agatgatggc ttcaacagat 
cttggtattc agaacgagtt ttaactgaaa 
taataagaac cattcaatac aacatgacta 
gtttggcagc tttagcaaat atgtcggcac 
agaggatcat cagtttattt tctttgctgt 
ccacacagtc cttgagaggt tcgctgagtt 
tctccttctt tagctcaagg ttgctgcaag 
tggaaatcat taagcaaggc gtcgttgcgc 
aattgaaatt caaatatgtg gaagaggagc 
ggtctcttgt ctacaactca gcagtcggcc 
tcaccatgga ttccgactga gggcaggatg 
gcccttcaag ttcttttatt tctgggtaac 
cttctgttaa agaggattgc acgagtgtgt 
gtgctagttg gcaatagata actcagcgta 
caacaataat aaatgtaaaa acctgctatt 



agcaacagga ctctggactg tcttcacact 60 
tccagagctt tcttcccctc tggccaacca 120 
tctgacagat gcctcaaatg cgccaaaccc 180 
cacacaaggt tttaccaatt cttgagattc 240 
atgtgtatat ggcccttata atattgttga 300 
ccattcatga agtgatacta aaaaatatta 360 
tctccttggg gagtctcctg atcctggtgg 420 
ggacacgaga caagtacctt cacacaaatt 480 
agtttcgttc tctccatcag tatgctgccc 540 
ctaaaaaaca caacaaagtt ctggaacaag 600 
ctaatgatgt tcctctacca gattatgtga 660 
ctggagctga gctgtcagtg gaacgggtcc 720 
tgcccaaaga cagactgaag aaatttccag 780 
agcccgagga gttttttatc ccctatgtct 840 
tgtactggaa tccacaggac atccagctgt 900 
ctctcccacc cggacccctc cagccaagca 960 
agaagtagac agacaggtta cttggtgtat 1020 
tttcctcaca cactttgatt tggagaattg 1080 
gatagtattg caaaaagggg aggaaataca 1140 
caaaaaaaaa aaaaaaaa 1188 



<210> 73 
<211> 1886 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7505917CB1 



<400> 73 

gggggcacct ctggtgacca agaccgggct 
ggggttgggg gggtccgaga ggagttgggt 
gtttcagccc ctctcggcgc gccgatacta 
tgccgctccc aaaccgcacg ataagcccca 
acggggccgg ggcggggcgg agtccggacg 
ccgaggcgcg cgcggcggcg cggccgctgc 
tgggctattg gcgcgcgggc cgggcgcggg 
gctgcccgtt cggcggcgcc ttcgccttgg 
tgcgcgcctt ccgcctgcgg gacgccgacg 
acacggatcc tgcgttcttc gagcaccttc 
gagccctgcc cgagggctcc ctcgccttcc 
cgctcctggt ggtgcagctg ctggagacac 
tggtggccac caacgcagcg cggcttcgct 
agatgggcct gaggcgggct cagggccccg 
acctgggcgg cttcgacagc agcagcaacg 
tggccgggac cctggcccac tccttcgtca 
acccgatgtt ggcgccagca gctggtgagg 
aggtgtggct ggagcaggtg tgtgcccacc 
gcgagcgggc agcctttgtg gcctatgcct 
tggacaccta cagcgtgtgg aggagtggtc 
tgggagagct gggctaccgg gcagtgggcg 
aggctcagga gatccgcaag gtcttccgag 
tggagtcagt cctcatcgta gtcagcaaca 
cccaggagct ggtggccgtg gggggccagc 
agcagacgtt gcctgggagc aaggctgctt 
tcatggacat gctgcagtta gcagaagagc 
tgtggcctcc aggggcccag gagccctgca 
tgcggctctg cctccagcag ggacagctgt 
gagccttggc ccagctgtcc ctgagccgac 
ctgcacagta ccaggtggtg ctgtccgaga 
cggggcagtc cccctgagac tcggagcggg 
tttccccaca aaaaaaaaaa aaaaaa 



gcgctccaaa gaggccgttg ggcctggagt 60 
gacatccccc accccatccc gggtccagct 120 
ttagccccac ccgtcctcca tcgagtcccg 180 
cagggagtgc gccataggcc ggggcgcgtc 240 
tcgggagcag gatggcggcg gagcaggacc 300 
tcactgacct ctaccaggcc accatggcgt 360 
acgccgccga gttcgagctc ttcttccgcc 420 
ccgccggctt gcgcgactgt gtgcgcttcc 480 
tgcagttcct ggcctcggtg ctgcccccag 540 
gggccctcga ctgctccgag gtgacggtgc 600 
ccggagtgcc gctcctgcag gtgtccgggc 660 
cgctgctctg cctggtcagc tacgccagcc 720 
tgatcgcagg gccagagaag cggctgctag 780 
atgggggcct gacagcctcc acctacagct 840 
tgctagcggg ccagctgcga ggtgtgccgg 900 
cttccttttc aggcagcgag gtgccccctg 960 
gccctggggt ggacctggcg gccaaagccc 1020 
tggggctggg ggtgcaggag ccgcatccag 1080 
tggcttttcc ccgggccttc cagggcctcc 1140 
tccccaactt cctagcagtc gccttggccc 1200 
tgaggctgga cagtggtgac ctgctacagc 1260 
ctgctgcagc ccagttccag gtgccctggc 1320 
acattgacga ggaggcgctg gcccgactgg 1380 
cacgaatgaa gctgaccgag gaccccgaga 1440 
tccggctcct gggctctgac gggtctccac 1500 
cagtgccaca ggctgggcag gagctgaggg 1560 
ccgtgaggcc agcccaggtg gagccactac 1620 
gtgagccgct cccatccctg gcagagtcta 1680 
tcagccctga gcacaggcgg ctgcggagcc 1740 
ggctgcaggc cctggtgaac agtctgtgtg 1800 
gctgactgga aacaacacga atcactcact 1860 

1886 



<210> 74 
<211> 1067 
<212> UNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7500701CB1 



<400> 74 

ctcgagccga attcggctcg agcggcgctc 
caccgcttcc tgcgcggcct cttaacgctg 
caccgcatgc tcggtccaga gtccgtcccg 
gcaccgcccc gaatcgggac gcacaatggc 
gcactgcttc gcctcctgcc ggagtaccgg 
gaaaaactcg cttcctgtga catcgtggtg 



ctgcctccct gcagggagct gcttatggga 60 

ctgctgccgc cgccacccct gtatacccgg 120 

cccccaaaac gatcccgcag caaactcatg 180 

accttccact gcgacgaggc actggcatgc 240 

gatgcagaga ttgtgcggac ccgggatccc 300 

gacgtggggg gcgagtacga ccctcggaga 360 
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caccgatatg accatcacca gaggcagggt tcaagcgtgc aatggatctg gttcaagagg 420 
agtttctgca gagattagat ttctaccaac acagctggct gccagcccgg gccttggtgg 480 
aagaggccct tgcccagcga ttccaggtgg acccaagtgg agagattgtg gaactggcga 540 
aaggtgcatg tccctggaag gagcatctct accacctgga atctgggctg tcccctccag 600 
tggccatctt ctttgttatc tacactgacc aggctggaca gtggcgaata cagtgtgtgc 660 
ccaaggagcc ccactcattc caaagccggc tgcccctgcc agagccatgg cggggtcttc 720 
gggacgaggc cctggaccag gtcagtggga tccctggctg catcttcgtc catgcaagcg 780 
gcttcattgg cggtcaccgc acccgagagg gtgccttgag catggcccgt gccaccttgg 840 
cccagcgctc atacctccca caaatctcct agtctaataa aaccttccat ctcatactga 900 
caaaaaaaaa atggtagctc ggggggcgtt aggatagatc tttagacccg gtggggtttc 960 
taagctggag aagtgttaga aaaaaagggc ggccgcccat caaattagtt ctctgccccg 1020 
ggattttttt tcgggccggt acccttgggg ggaccaaatt tcccctt 1067 

<210> 75 
<211> 1220 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7500702CB1 

<400> 75 

ctcgagccga attcggctcg agcggcgctc ctgcctccct gcagggagct gcttatggga 60 
caccgcttcc tgcgcggcct cttaacgctg ctgctgccgc cgccacccct gtatacccgg 120 
caccgcatgc tcggtccaga gtccgtcccg cccccaaaac gatcccgcag caaactcatg 180 
gcaccgcccc gaatcgggac gcacaatggc accttccact gcgacgaggc actggcatgc 240 
gcactgcttc gcctcctgcc ggagtaccgg gatgcagaga ttgtgcggac ccgggatccc 300 
gaaaaactcg cttcctgtga catcgtggtg gacgtggggg gcgagtacga ccctcggaga 360 
caccgatatg accatcacca gagatgtatg agaactttgt ggaggaggtg gatgctgtgg 420 
acaatgggat ctcccagtgg gcagaggggg agcctcgata tgcactgacc actaccctga 480 
gtgcacgagt tgctcgactt aatcctacct ggaaccaccc cgaccaagac actgaggcag 540 
ggttcaagcg tgcaatggat ctggttcaag aggagtttct gcagagatta gatttctacc 600 
aacacagctg gctgccagcc cgggccttgg tggaagaggc ccttgcccag cgattccagg 660 
tggacccaag tggagagatt gtggaactgg cgaaaggtgc atgtccctgg aaggagcatc 720 
tctaccacct ggaatctggg ctgtcccctc cagtggccat cttctttgtt atctacactg 780 
accaggctgg acagtggcga atacagtgtg tgcccaagga gccccactca ttccaaagcc 840 
ggctgcccct gccagagcca tggcggggtc ttcgggacga ggccctggac caggtcagtg 900 
ggatccctgg ctgcatcttc gtccatgcaa gcggcttcat tggcggtcac cgcacccgag 960 
agggtgcctt gagcatggcc cgtgccacct tggcccagcg ctcatacctc ccacaaatct 1020 
cctagtctaa taaaaccttc catctcatac tgacaaaaaa aaaatggtag ctcggggggc 1080 
gttaggatag atctttagac ccggtggggt ttctaagctg gagaagtgtt agaaaaaaag 1140 
ggcggccgcc catcaaatta gttctctgcc ccgggatttt ttttcgggcc ggtacccttg 1200 
gggggaccaa atttcccctt 1220 

<210> 76 
<211> 3280 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 

<223> Incyte ID No: 6044343CB1 

<400> 76 

gcggcggcgc ctggatgccg gcccggggct gggccggaga ctggcgcccg ccccgaagcg 60 
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actttcggtc cccgacccgc tccgcctgac 
tcttgctgtc gacatctttg acaagttcaa 
cgccattcat gaagaattcc ccagggagct 
cttcaaacca cctgaagaaa aagaaggccc 
cccgcagacc acacgcccag gcctggcacc 
cgacaccctg atgacactgg ctagttcacc 
gggcagctcc cgcctgagcc ctatgacccc 
cccatcatta atacgtcgat ggtgagcgca 
agacccctgg agaggcccgt cctggtggag 
aagcctgtct gtgtgttctg gaaccattcc 
atcttcatgt gagggctttg ccaagacaca 
agatcgtccc acggggacag tctcttgtgg 
gcagtgagcc tcctgctctg cttctgtggg 
cagacgcgca caggcctgca cgtggaaaca 
gacgcacaca gagctgcacg tggagacacg 
cacacaggtg cacgcggacc tgcacgtgga 
cacaggtgag cgtggacctg tgtgaacaga 
catgcaggcg cacaaagaga tccctcaagg 
gggactcccg agatgcagtg gccaacacta 
cagaggcctg ttctggtcac acatgggagg 
tctgctgaca ctgggaactc ccagctacgt 
tgaccttagt cttgcccgag aggccgacac 
aagaggccac accttgccac agcactgttt 
tggcgaaggc aggaggtggt gcacgagagt 
aaggcgtttc gtacttagaa aaatgcaaaa 
ggattcaccc atgtatgaga cagtggattc 
agccagtagt gtttgtgctt tggttttgct 
gacttacttg agttgctctg tctttacagg 
cacttttgct gtctctgttt agaagaaaac 
agcacaatct gggaggggaa gacctgctct 
aggggagaca ctgtgccgga ctcagggagt 
tgctgaacgt tgtgcatctg ggggcgcccg 
cctctgggca cagtggagag ccatgagtgg 
tgagcccacg ccaggagcca gttcagccca 
caagttctag ttcataagca accagcacag 
aaaagagaga agaaaggaac ccctttcacc 
cgggcattca tcttttttcc ggcaattagc 
tgttgcacct atgatttcca gattgtaaag 
catttacaga ttagcaaact aaggatctga 
gggcagggcc ctgtcagaag ccgagacaca 
catcagtgag gcaggagtca taatggggcc 
gtgccacctg aagatggaag tgtcagagga 
cctcagaaag agacagagga gtgcatcgca 
agcagcgggg cacgtggtgg acgcaagacg 
cgtagaactt gaacttcaga gggtgctcct 
gaagggagac tgttcccatg gtcctgcctc 
gggtggctgt tcggtgcagg gggccggtga 
gccgggcttt cagcactcac tatacctgct 
cccagggccg ggcaaagact gcccccaaca 
gcctcttctc ccttctctac gcagtgtcag 
caagctcctg tccaggaacc ggacccatgt 
cgaggtgctc atggacgtct ccaggtgtga 
gctgagccta cccccaaacc tgcccctcct 
aacgaagcaa atcaaccaat taaaaaaata 

<210> 77 



ccctatgatg aagagggcgt ccactggagt 120 
cttcacggga gccagagtcc catggttcga 180 
ggagtcctcc atctcctt.cc cagccaactt 240 
cctggtaagg ccagccagcc ggaggaccac 300 
gagggggagg ccccgatcag caggtgaaag 360 
ttcactctgg tcatcgttta ctgtaccccg 420 
aaccgccgca gcctctggct gcctcactgg 480 
ctggtgtaca gcgagggggc tccacttccg 540 
ttcaccctgc tggagatgga gaagcgaacc 600 
ctggcgtaaa tgtgatgccg ggagttctgc 660 
gagaacccgc agggttgttg gctcagcggc 720 
tctgtgtctc ccccttcagc cccctgtggg 780 
tgcatcagag cccccccaga tgcccacacg 840 
cagacatgcg cacacgcagg cccacacaca 900 
tgcgcacacg caggcccaca cgcaggccca 960 
gacacagatg tgtgcacatg caggcccaca 1020 
tgcgcgcaga cctgcacgtg gagacacaga 1080 
ctgacaccca ggaaggggcc tctcacttgt 1140 
gtgtagtgcc cagttcactg ctggataaga 1200 
cagaacccag agactgggcc caggagccct 1260 
gtgggggtcc ccataccaga caaaggtccc 1320 
agcccagctt tggggtctgg ctttacccac 1380 
atctggcctg tttcagaagc accgccagag 1440 
ctacgttcta gcatccattc aagtgaggga 1500 
ttaatgattc tcacccacac atgagactgt 1560 
agagccatgc tgtggctctc atctgctggg 1620 
tttttagcta catggtttat gtgaggtgcg 1680 
gagcttgctt tttctctgct ggtttgtttt 1740 
attagaactg gggtgtggtg gaaggatggg 1800 
ctctccacat accaggaggg agtggcagga 1860 
caagagcgtg agcaccccca ggcttcaatc 1920 
tgcagcagga acactctccc catcacaatt 1980 
gggctgtcct gggggctgcg ctatggtgag 2040 
gcctcattct tctcctgctt ctttgctttc 2100 
tacctggaat gtaagaggcg cttggcttat 2160 
acgagggacg tcaggcccag ctctctgtcc 2220 
ttcagctcta atgacctgct gtctgcttta 2280 
tttgtggcaa atgtattaac gttcacatcc 2340 
gaggttatgt gattttccca gtctggcgga 2400 
cctcgtaaca gcttgttatt tggggcttcc 2460 
ctggcttctg ggcaattccc ataaaacatt 2520 
aggcgagatg ttatttctga aagcatctac 2580 
gctgtgtgaa cccggggtgg tggtggttcc 2640 
gccccgtttg agttttgctg ctaggtgaca 2700 
tggaggccct gatgagagag acacagaagg 2760 
tgcggcagcg agcctgctgc tcttgcagct 2820 
ggaacacggt ggggagcagc ctgcgagggc 2880 
gtttggtttg gccaggatgg cctgggagtc 2940 
gacatgggcg gagggaaccg tctcacccct 3000 
tgggacggga gggtggtctg cccggggctt 3060 
cgtacgccag tgcagccaca cggccagctt 3120 
gatgggcatc ttcttgtggc cactatgccc 3180 
caaactaaat cttccattct aagcactttg 3240 
tattcccaga 3280 
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<211> 5567 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7503990CB1 

<400> 77 

attcgcgcgc gccttcccta gccacccggg 
gcgccccttg cgggaagtga gggtgggttg 
cttcgggttt cgcctctcgg ctccctctgg 
gaagtccgcc tgagacttgg gtcaaggagt 
aatccggtga gcggtaagga aagtgatgcc 
agcaagaaca catccactcc agagatacct 
atacttggta accgagggaa ttactaagac 
atcctgacac tatgaatgct acttggatgc 
aggggccgtg gagctggcct cggcctcggc 
gtgctgaatg gctgcgatgg cgcccgctct 
ccggttcaaa ctggctcccc catcctctac 
caacgccaac atccttattg ctgccaacgg 
tcccagccta gatttccgaa ataatcctac 
ggtggcatct tatctctgct ctgatgtaac 
gcaaggggtc ttcagcaagc agacagtcct 
tgaactccga gctgagctgt tggggagaca 
tagaaccatg aatacgagtg gtcagacagc 
taagaaattg actaaaagtt caacacattc 
gggaaaacgg gctctcactt catctgctct 
tggggacttg aaggggggta tgaccaattg 
acacacaatt ttgtatagca ataatagcac 
acagccggca cttcaaggaa gcagtagatt 
ggggggtgtc aaattggagg gtaaaaagtc 
agattctgac acaaggataa cagctttact 
ccgcagatta caaaagcgct tacaggttgt 
acatcagctg ggtggatttt tggagaagac 
gagaccacgg agccagttga tgctgactcg 
cagtgagacc accacttcag agggacttag 
agaattggag agatttacag ctagtggcat 
tgattcagat gtcactgaca gtagttcagg 
gaccagagct gatcccgagc agcgtcatgt 
ggctgcagac cgggcagcta ttgtcagccg 
cttggaatat cgaattcgtc agcaaacaga 
tgagtctgtt tctcagccat tggaaaacca 
gtcactgtct accaaatcat gtggagcact 
tcagcctgtc ttggcagacc acattccagg 
taagaagcaa cgactgaatc tcgtctcttc 
gacacgtcct gtactgagct gtaagaagcg 
tctttccaag aaggttcacc ggaacagcac 
ctgcgcactg tgtggttcag gcagcatcaa 
ccctctgttg gaacgtcttt cccagttgga 
agatgatgtt cccacaagcc tgcatttcca 
gccttttgac aaaatcaaac ctcccaaaaa 
gggcagtctg ccagattcag ctcgtaagga 
aacagccatg ttgaagcatc acacagacat 
ccatcctcca cacagtccct tggtgcgaca 
cgccagctct agctcacagg ttacagccag 



gttgcctcct aacatggaat ggccaaagga 60 
ggactgggtc cgcgttgggg gaggtgcaat 120 
ctctggagtt gggacccctt gtgggctctg 180 
caaactgtcg ccccccgctc ctcccccaga 240 
aagtcttcga agcctcagtg acaaacgcat 300 
tctcgaaaca aaagattttc ctacctgctt 360 
ttcttgctca tttctgagta ttgtctttat 420 
ctcttaagtc tgttctctgg ggaggcagta 480 
atcgggagag gctggacttc ctgtctctct 540 
cactgacgca gcagctgaag cacaccatat 600 
cttgtcccct ggcagtgccg aaaataacgg 660 
aaccaaaaga aaagccattg ctgcagagga 720 
caaggaagac ttgggaaagc tgcaaccact 780 
atctgttccc tcaaaggagt ctttgaagtt 840 
taaatctcat cctctcttat ctcagtccta 900 
gccagttttg gagttttcct tagaaaatct 960 
tctgccacaa gcacctgtaa atgggttggc 1020 
tgatcatgac aattccactt ccctcaatgg 1080 
tcatgggggt gaaatgggag gatctgaatc 1140 
cactcttcca catagaagcc ttgatgtaga 1200 
tgcaaacaaa tcctctgtca attccatgga 1260 
atcacctggt acagactcca gctctaactt 1320 
tcccctgtct tccattcttt tcagtgcttt 1380 
gcggcgacag gctgacattg agagccgtgc 1440 
gcaagccaag caggttgaga ggcatataca 1500 
tttgagcaaa ctgccaaact tggaatcctt 1560 
aaaggctgaa gctgccttga gaaaagctgc 1620 
caactttctg aaaagcaatt caatttcaga 1680 
agccaacttg aggtgcagtg aacaggcatt 1740 
aggggagtct gatattgaag aggaagaact 1800 
acccctgaga cgcaggtcag aatggaaatg 1860 
ctggaactgg cttcaggctc atgtttctga 1920 
catttacaaa cagatacgtg ctaataagat 1980 
tggtgcccct attattggtc atatttcaga 2040 
cagacctgtc aatggagtta ttaacactct 2100 
tgacagctct gatgctgagg aacaattaca 2160 
atcatctgat ggcacctgtg tggcagcccg 2220 
gaggcttgtt cgacccaaca gcatcgttcc 2280 
aatccgccct ggctgtgatg tgaatccctc 2340 
caccatgcct cccgaaattc actatgaagc 2400 
ctcttgtgtt catcctgttc tagcatttcc 2460 
gagcatgctg aaatctcagt ggcagaacaa 2520 
gttatcgctt aagcacagag cacccatgcc 2580 
caggcacaaa ttggtcagct ccttcctaac 2640 
gagcagttcg agctacttgg cagccaccca 2700 
gctctccacc tcctcagatt cccctgcacc 2760 
cacatcgcag cagccagtaa ggaggagaag 2820 
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gggagagagc tcattcgata ttaacaacat 
tcgcgtagag aaactgcaat acaaggaaat 
tcagtctctg aaggggagtc ctgatgagga 
agccttcgcc gccctgcatg ccaaatgtga 
cacgagtgtg ccaccccagc ggcggggcag 
aaccccccag ctgggcagtg ccaacccctc 
cagtagccac tctttgtcag aatactccca 
ggaactgcac tcagcacccc tcacccctgt 
tgaggatacc cgttgttcca caccagagct 
ggagcggcgg accttccccc tggcgcacag 
tgcacaggag cgagcagccc gctgcactcg 
gacagaggca gcgcccacct cgcctcccat 
agcagccaca gctcagcgcc cgactcacag 
cactaactat tggcattaaa gcttcagaaa 
ccggaaattt tcacagtttt tagtgaactt 
ttttcttgtt ttaatttttg ttttgttttt 
cacttcctcc cgttggcaaa cagaagttca 
acatcctctg cactgttgaa tcactggact 
acacctgtgg gcagggcaga acagcctggc 
tagggtggaa tggggcaggg tgctctggac 
ggctctgctg tggccctcct ggctgggtcc 
cagagccfcct ctgccataac tgccagctgc 
tttcctttct cacccagaac ttaagagact 
tctgttttga ctctttcact gcagtaacct 
gccccttggt acccttccct gtctgctctc 
ttccacttgg aatattttta ccacctatcc 
aacctctatt cccactcccc tgcaaactgg 
gtgtgatttt tttttttttc cctctgattc 
cctgtcggtg ttagttccac tcttggtctt 
tggtcctagc tgtggttccc attgttcccc 
tctcctgtat gatcacatca ccaaaaaggg 
cattttgtaa tcgtataaaa atagtagaca 
tttcaacatt agtgattttt ttttctgttt 
aaaaaaaaaa tacaataatg caccatatcc 
ggttgaaaat tctgaataac ccttttagca 
aaaaaaaaac cttgtatttt gtaaatattt 
agcgaaacat gtagctgtct ttgttctata 
tgttaatgtc ttgattctgt tcgcaaagca 
aaaaattgaa aaaaataaaa aaaaaagtta 
gccccgaatt cgaaaccatg taaaagctgt 
ttccacacaa cttaggaccc ggaaccgcaa 
cctacccccg ttaattggct ttgcgctcac 
cccagcgcct ttaaggatcc gcccaacgcg 
tttccgtttc cgggcccact aacgcgctgc 
cagctcactc aaagggggta aaagggtttt 
acttttggcc caaagcccag ctaaaggccg 

<210> 78 

<2U> 1263 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7504655CB1 



tgtcatccca atgtctgttg ctgcaacaac 2880 
ccttacgccc agctggcggg aggttgatct 2940 
gaatgaagag attgaggacc tatccgatgc 3000 
ggagatggag agggcacggt ggctgtggac 3060 
caggtcctac aggtcatcag acggccggac 3120 
caccccccag cctgcctccc ctgatgtcag 3180 
tggtcagtcc cctaggagcc ccattagccc 3240 
ggctcgggac actctgcgac acttagccag 3300 
ggggctggat gaacagtctg tccagccctg 3360 
tccccaggcg gagtgtgagg accagctgga 3420 
acgcacctca ggcagcaaga ctggccggga 3480 
tgtccccctc aagagtcggc atctggtggc 3540 
atgagcggga gacagccatc taaacagact 3600 
tctctgcgtt tgatattcaa acatcatatg 3660 
aaggaattta gatcctactt tggtattttt 3720 
gtttccatgt tttcttgtca cacacctgag 3780 
ggatgagacc ctgctggcct ggtcctgggc 3840 
tactgatctt agatgaccac cccctccctc 3900 
gggctacagt ttagcatggc cttcttgagc 3960 
tcttaccccc tcccctccca tctgtggctt 4020 
ccttggtttt tcgtgctgga acatccccac 4080 
tctccccgag tgctcagctg gcagaacacc 4140 
gattttttgt ttcatctgca tttggtcttc 4200 
ggctgtggct gctcaggttc ccctcctcat 4260 
ccatgccatg tacacaccca caacccgtcc 4320 
tgatctttga aggtagggtt aggactactt 4380 
gggttgtggg aagtgagcag ccatctccct 4440 
actttgccat gtttccttca catccagatc 4500 
tcacgctccc cttgcctgtg gaacattgtc 4560 
cttcaccctt ctctgttaac cttgtgcctg 4620 
ggagggggga gaagactctt tttttttggc 4680 
actgcttaat ggttggggtt ttttcacaat 4740 
gcaagttaaa gggtttgtca ttgtttcttt 4800 
ctatgcataa agtgcttctt ctatttataa 4860 
ttgaaaaaaa aaacaaaaac aaaaaatgga 4920 
tcttttcctg ctttggagct gtgtaatggc 4980 
gaaatgcttt tcttcagaga agctgatctt 5040 
cagactagtg cttaaaaaaa aaaaagaagg 5100 
cagaaaaaaa aaaaaggggg gggggtaccc 5160 
tcccggggga aattttttac ccgcccaaat 5220 
gggttaaacc ctggggtccc aaatgggtga 5280 
tgcccgtttt ccagtgggga accctgtgtt 5340 
gggggatacg cgttttcctt ttggggcgcc 5400 
cccgggtctt tcggcttcgg cggggggtat 5460 
ccaaagattc gggggttaac cctggaaaga 5520 
ggacccttaa aaaggcc 5567 
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<400> 78 

ttcagggcgt atcggtaaca gcaccctacc 
cttaaacaca accaggaaga aaaaaaatca 
gggtcttatt ttagaaatat accttctagg 
acccggatca aggaaccact gtgacatttt 
aggagttcag gattccaatc ctaatttcca 
ccagccccac cccggggcgc aggaggcgcg 
tcggggaggc tctgggcgcg gggcgggact 
cgccgccctg gcccggcctt gccttgcgct 
ggcggcggct gcggccgctg gcagcgctgg 
ccacagcccg ggccgggcag acaccgcgcc 
tcaccgagga ggagctggcc cgctatggcg 
ctacaatgcc ttgacgggga aggactccac 
tgcagacctc acccatgaca ctacgggtct 
ggtcttcacc aaagtgtaca aagccaaata 
tctcaatgag gatggcagcc ctaacctgga 
catcaaggat gagttctgat gttccccctg 
gaagacacta ggtgctgaat ctcctgcaaa 
gatctgaata aaacagatgc ttaccctgga 
agtgaagagc ccctgaaaag atccttgagg 
cagccatgcg atgactcctg ttcgccccca 
tcttttttga ccccgtgaac atgcgggaat 
cca 



ctttgcaaaa atctccttgg gaaccagaaa 60 
gccaaaaata aaagcgaatt aagacagttg 120 
ttctggtatg ttgggctctg tcatctttta 180 
ttggtaactt ccagtctgta gtttggattt 240 
ggttccagct gcgtggcccg agtagcccgg 300 
cgaagggcgg gagttctcgg gtccccgccc 360 
ccgggacgct gggcgggcgt gctagggccc 420 
gcgcgctcac catggtgggc cccgcgccgc 480 
ccctggtcct ggcgctggcc ccggggctgc 540 
ctgccgagcg ggggccccca gtgcggcttt 600 
gggaggagag ttttatggac gaggagcccc 660 
tagaggggta gccaagatgt ccttggatcc 720 
cacggccaag gaactggagg ccctggatga 780 
ccccatcgtc ggctacactg cccggagaat 840 
cttcaagcct gaagaccagc cccattttga 900 
caggagcagg ttcttgggag cgtgaggcag 960 
actggctgcc tggaggccct gagccaccca 1020 
aaaaaaaaaa aaaaaaaagc cccaaacttt 1080 
accactttgg gcctattcag aagaggtaca 1140 
ggtggggaac ttgctctgta tctgaatagc 1200 
tcagaagact cgagatgcta tggtccctgt 1260 

1263 



<210> 79 
<211> 1854 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_ f eature 

<223> Incyte ID No: 7504690CB1 



<400> 79 

cggctcgagg ctgagaccca gagtcaccca 
gaagtgggct gtgacagatc aggaaacaga 
ccctacagct aacaagcggc agaggcagga 
ctgtggccac tgctgctgct gctgctgctg 
cggtcccagg atgccaggct gtcctggctt 
gccctggtct gggcagccac ctggcagcgg 
caccagagcc agcagcaggc cctgaggtgg 
tccctcagaa ggagcacaga cataagcacc 
agccagaccc agcaggaaga cagtgggcgg 
agagcagcat tctggattcc tctgcgggct 
tgagggggct gaggaatctg tcagaggaaa 
aagcctctcc ccgctacaag tccctgcggt 
acctggtggg gcagggagcc ttccgagcac 
cccccttccc ccctgcgatg ccccgggtcc 
aggagagggt ggtgtcctga gtcaagtcct 
cctcgcccct ccctctggga cctctccgga 
tgtgtcacct gacatttgcc catgagagcc 
gaccggttct gaagtatggg cctccggggt 
cccatgtccc ggcatgaagg acactgctag 
tatcacagcc caaagaggtt ctctggtggc 
caagctagaa gtgttggggg gttaatgtcc 
gcaggacctc agcttcattg tccttgagca 



ggggtctccg tcacgtgcca ggagtaggca 60 
gctcagtgca gcccactaaa ttgctcaggg 120 
tctgcactca ggagctgctt ggagatgctg 180 
ctgccaacat tggccctgct caggcagcag 240 
gctggcctcc agcaccgagt ggcatggggg 300 
cggaggctgg agcagagcac gctccatgtg 360 
tgtctacagg gagcccagcg cccccactgt 420 
ttccggaatc atctccctct gaccaaggcc 480 
gggccaagct gctggaccat ggctgtgtgg 540 
ctgctcccca ctacgaggtg tttgtggcgc 600 
atcgagacaa gctggaccac tgccttcagg 660 
tctggggcag cgtgggccct gccagagtcc 720 
tccgggcagc cctcgctgcc tgcccctcct 780 
ttcggcacag gcacctggcc cagtgtctgc 840 
gccccaccgc ccagctcccc ccagaggcca 900 
tggggagtcc ttggccaggg tctctgactc 960 
gctgggcctt agagaggcct tggcccagct 1020 
tagcagatgc cagcagtgcc tgcccgtgtc 1080 
agagttacca tgcacaccga tggtttcctg 1140 
cacagctgtg tgctcagtca gtgcactggg 1200 
ccaggagcag caaccctgag tcaataagga 1260 
ggacaattct gaagtgtatt ctacataaac 1320 
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tctcagagga tgcccagcag gatggagtcc 
atgtacttcc tgcgggggct ctcccttccc 
ctgggatggc tcccaaataa acctcttgca 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 
gctctagggg ttccaggttt aggtacgggt 
cccctaattt catttcacgg gcggtggttt 
ttacccaatt taatcgcttt gaggaaattc 
gggcccgcac ggttggcctt tccaaaaatt 
ctgtaggggg gttttaagcc ggcggttttg 

<210> 80 
<211> 1332 
<212> DNA 

<213> Homo sapiens 



cagttgcccg cagcagtaac ccactcattc 1380 
tctcttcccc actcccccgc cttgggcttc 1440 
cccagaaaaa aaaaaaaaaa aaaaaaaaaa 1500 
aaaaaaaaaa aaaaaaaaaa agggggggcc 1560 
gcatgggagg tcatagctct tctaaggtgt 1620 
taaaaggtcg tgactgggaa aaccctgggg 1680 
cccttttggc aagttggggt aatagcgaag 1740 
tgggccctct gattggcgat tgggacgcgc 1800 
gtggttacgc cggtgacccg ttac 1854 



<220> 

<221> misc__ feature 

<223> Incyte ID No: 7504720CB1 

<400> 80 

cggctcgagg cctggtgacc agaagtttgg 
ccaggctgac cctgctgacc ctcctgctgc 
caaaacaaac ctggagagca tcctctctta 
cctgaagggc ttcacgacca aaggtgtcac 
cctggccata agggacacct ttgtgaatgc 
agtcctaagc aacaacagtg acgccaactt 
caccaacaac aagatcagcc ggctgctaga 
cctcaatgct atctacctga gtgccaagtg 
aatggaaccc tttcacttca aaaactcagt 
gtaccctgtg gcccatttca ttgaccaaac 
ctcccacaat ctgagtttgg tgatcctggt 
catggaacag gctctcagcc cttctgtttt 
caagttccag cccactctcc taacactacc 
gctctcaatc atggagaaat tggaattctt 
gctgacagag gacccagatc ttcaggtttc 
gacagagact ggggtggagg cggctgcagc 
ggtctttgaa gtgcagcagc ccttcctctt 
tgtcttcatg gggcgagtat atgaccccag 
gagcgctacc tctccagcct cagctctcag 
ggcccctgcc acctcctgcc tcaggtgtcc 
tgggcaaggg acctgcttct attagccctt 
ctttttgcag ctttctctag ttcaagttca 
atgaaaaaaa aa 



agtccgctga cgtcgccgcc cagatggcct 60 
tgctgctggc tggggggctg gggagaacac 120 
ccccaaggac ttcacctgtg tccaccaggc 180 
ctcagtctct cagatcttcc acagcccaga 240 
ctctcggacc ctgtacagca gcagccccag 300 
ggagctcatc aacacctggg tggccaagaa 360 
cagtctgccc tccgataccc gccttgtcct 420 
gaagacaaca tttgatccca agaaaaccag 480 
tataaaagtg cccatgatga atagcaagaa 540 
tttgaaagcc aaggtggggc agctgcagct 600 
accccagaac ctgaaacatc gtcttgaaga 660 
caaggccatc atggagaaac tggagatgtc 720 
ccgcatcaaa gtgacgacca gccaggatat 780 
cgatttttct tatgacctta acctgtgtgg 840 
tgcgatgcag caccagacag tgctggaact 900 
ctccgccatc tctgtggccc gcaccctgct 960 
cgtgctctgg gaccagcagc acaagttccc 1020 
ggcctgagac ctgcaggatc aggttagggc 1080 
ttgcagccct gctgctgcct gcctggactt 1140 
gctatccacc aaaagggctc cctgagggtc 1200 
ctccatggcc ctgccatgct ctccaaacca 1260 
ccagactcta taaataaaac ctgacagacc 1320 

1332 



<210> 81 
<211> 1435 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504722CB1 



<220> 

<221> unsure 

<222> (1) ... (1435) 
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<223> a, t, c, g, or other 
<400> 81 

tgcctgcctg agtcacgtgt cagggggaag ctggaaggcg tcgttctcct ttcccagctc 60 
tcctgcctgt ccgccatgtt ttcaggccgg gtctggcttg gtcttccccc gtaaggaaat 120 
ggccggggag ctccagggga cccaggcgcc gtcgcttcgg cggagcctgg gctgaccagc 180 
caggacagcg gggtaaaccc gaacaattct gcgcgaggta gggaggccat ggcgtccggc 240 
agtaactggc tctccggggt gaatgtcgtg ctggtgatgg cctacgggag cctggtgttt 300 
gtactgctat ttatttttgt gaagaggcaa atcatgcgct ttgcaatgaa atctcgaagg 360 
ggacctcatg tccctgtggg acacaatgcc cccaaggttg ctacaactat ctgtatagga 420 
tgaaagctct ggatgccatt cgtacctctg agatcccatt tcattctgaa ggccggcatc 480 
cccgttcctt aatgggcaag aatttccgct cctacctgct ggatctgcga aacactagta 540 
cgcctttcaa gggtgtacgc aaagcactca ttgataccct tttggatggc tatgaaacag 600 
cccgctatgg gacaggggtc tttggccaga atgagtacct acgctatcag gaggccctga 660 
gtgagctggc cactgcggtt aaagcacgaa ttgggagctc tcagcgacat caccagtcag 720 
cagccaaaga cctaactcag tcccctgagg tctccccaac aaccatccag gtgacatacc 780 
tcccctccag tcagaagagt aaacgtgcca agcacttcct tgaattgaag agctttaagg 840 
ataactataa cacattggag agtactctgt gacggagctg aaggactctt gccgtagatt 900 
aagccagtca gttgcaatgt gcaagacagg ctgcttgccg ggccgccctc ggaacatctg 960 
gcccagcagg cccagactgt atccatccaa gttcccgttg tatccagagt tcttagagct 1020 
tgtgtctaaa gggtaattcc ccaacccttc cttatgagca tttttagaac attggctaag 1080 
actattttcc cccagtagcg cttttttctg gatttgcatt caggtgttat tcttaatgtt 1140 
tctgtcaaag cttcttaaaa atcttcactt ggtttcagcc atagttcacc ttccctgttc 1200 
caggtttatt taattccaaa ggtgagagtt ggagtgagat gtcttccata tctatacctt 1260 
tgtgcacagt tgaatgggaa ctgtttgggt ttagggcatc ttagagttga ttgatggaaa 1320 
aagcagacag gaactggtgg gaggtcaagt ggggaagttg gtgaatgtgg aataacttac 1380 
ctttgtgctc cactttccag cacactgcgc cgnttaactc gtgagtccgg ctcgg 1435 

<210> 82 
<211> 554 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7504733CB1 

<400> 82 

ggaggcaaga tgtgtgagca gcatccaaag aggcctgggc ttcagttgtg gagagggaga 60 
gagccaggtt ggaatgggca gcaggtaggg agatccctgg ggaggagctg aagcccattt 120 
ggcttcagtg tcccccaaac ccccaccacc ctcttcctag gccgccctcc ccactgttac 180 
caacatgaag ctgctcgcag caactgtgct actcctcacc atctgcagcc ttgaagtact 240 
tccagaccgt gactgactat ggcaaggacc tgatggagaa ggtcaagagc ccagagcttc 300 
aggccgaggc caagtcttac tttgaaaagt caaaggagca gctgacaccc ctgatcaaga 360 
aggctggaac ggaactggtt aacttcttga gctatttcgt ggaacttgga acacagcctg 420 
ccacccagtg aagtgtccag accattgtct tccaacccca gctggcctct agaacaccca 480 
ctggccagtc ctagagctcc tgtccctacc cactctttgc tacaataaat gctgaatgaa 540 
tccaaaaaaa aaaa 554 

<210> 83 

<211> 2365 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_ feature 
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<223> Incyte ID No: 7507100CB1 
<400> 83 

agactctgcg cacgcgcagt tccccctccc gcccctgctt gccggtgatg gcgcataacg 60 

catgcgcggg gagggcggag ctgggcgttg ccgtggctac tgggaacgca tttcacgggg 120 

gcggggcgtg gttccggggc ggggcgcggc cgccggaagt gcgtggccgc ccggggccat 180 

ggcgacactc agcttcgtct tcctgctgct gggggcagtg tcctggcctc cggcttctgc 240 

ctccggccag gagttctggc ccggacaatc ggcggccgat attctgtcgg gggcggcttc 300 

ccgcagacgg tatcttctgt atgacgtcaa ccccccggaa ggcttcaacc tgcgcaggga 360 

tgtctatatc cgaatcgcct ctctcctgaa gactctgctg aagacggagg agtgggtgct 420 

tgtcctgcct ccatggggcc gcctctatca ctggcagagt cctgacatcc accaggtccg 480 

gattccctgg tctgagtttt ttgatcttcc aagtctcaat aaaaacatcc ccgtcatcga 540 

gtatgagcag ttcatcgcag aatctggtgg gccctttatt gaccaggttt acgtcctgca 600 

aagttacgca gaggggtgga aagaagggac ctgggaagag aaggtggacg agcggccgtg 660 

tattgatcag ctcctgtact cccaggacaa gcacgagtac tacaggtgcc tgctgaggct 720 

gctccccctg ccgcagggct cagcctccat cgtggcgccc ctgctgctga gaaacacatc 780 

agcccggtcc gtgatgttag acagagccga gaacctactt cacgaccact atggagggaa 840 

agaatactgg gatacccgtc gcagcatggt gtttgccagg cacctgcggg aggtgggaga 900 

cgagttcagg agcagacatc tcaactccac ggacgacgca gacaggatcc ccttccagga 960 

ggactggatg aagatgaagg tcaagctggg ctccgcgcta gggggcccct acctgggagt 1020 

ccacctgaga agaaaagatt tcatctgggg tcacagacag gatgtaccca gtctggaagg 1080 

ggccggagga acgggcaggc cgctggcctg aggtctggag aaacccctgg agaagggtgt 1140 

ccccaccagc ccatacagcg tgtgtgtgga gggggccttg acctccgtga tgtctactgt 1200 

gcctcaggat aaggacccgc catgccctgg ctagacagtg tgtggttttt tattggcacc 1260 

tcagtctcaa cattttcttt tcggattcat gaggaaagag aaatcctggg gttggacccc 1320 

aagacgacgt acaacaggtt ctgcggagac caagagaagg cgtgtgagca acccacccac 1380 

tggaagatca cctactgagg aggatcctcc agggccgctc cccggacccg acaggcgcgg 1440 

gtggatgcag gttctgtcgc cgtggagtca ccgtctactg ccagccggga gctgggcgga 1500 

caggaccgtc cctcgcaggg tcccaggccc agaagaggcc ccacgcctct agagctgggc 1560 

tccgtcctcg gcgttgccag ccgccatggc tgatgaagag gctccgctgc tctcgggggt 1620 

ggcggttgtt ttcaggcagc gtctgtgaac ccacagctcg gttgccagca gtgcccgcgt 1680 

ggtgacccag aagcaggagt gtttgtcagg ctcccgctct ggcctttcca gccacctttc 1740 

atgtcttcat attttaagtg cattgaggat agatgcaggc gggtgagctg ccctccgtca 1800 

ggtggacccg ggctgacatt tccctgggag ctggtgcaag gagaagcgtc attttaaatg 1860 

tctgcagagc gaccaggggc ctcatgaatc tctccgttgc cctccgcgca gcaggaggct 1920 

gcctgtgtgt ttcctcctgg gatgcgtgca aggcagacct ggtgctgcaa aggaaagggc 1980 

ctgaggcctc agggagcccc gtggagggat gacagttcag gccctactgc tggcacgtca 2040 

gagcactggg aagtttttca gtgacgtctc tggggcactc agtggattgt ctgtaggaaa 2100 

cttgcagctc tgctcctcac accaggcccg gctggccacc caccctcgcc cccactggcc 2160 

acccctccct cgccccgact gccccgcccc accctcaccc cgactgcccc gccctcgccc 2220 

ggctggccgt ccctgccctc gccccggctg gcaggtgcac atggggcctc caggtctgcc 2280 

attcgctatt gagaactaga aatgaggaag gacagttacg ctaactccaa aaggctgtct 2340 

aggatgagct gctttatcag ggagc 2365 

<210> 84 

<211> 703 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7503330CB1 

<400> 84 

gccgaaggcc agctggagcg tcggcgctgc ggggccgggg gggtcgaatg ttcgtggcat 60 

cagagagaaa gatgagagct caccaggtgc tcaccttcct cctgctcttc gtgatcacct 120 
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cggtggcctc tgaacacgcc 
gccttcgagg aggacgtgca 
gatgaacaca atgcagctct 
cccagtggca gcttggggaa 
actgagatgg ccgtcgtgct 
ggaagacacc tttggtgaaa 
tgcctccctg gctgtgtctt 
cctcacagcc aggaaatttg 
gacgaactga aaagctaaca 
aataccagac caacctcaat 



caactacttc gacacgtcgc 
gctgccgcgg gcctatatgg 
ccgaacagca ggatttccca 
aagacccagc gctccgttta 
caacggtggg accatcccaa 
gactttaagt tccagagaat 
tcttgaggga gaaatcggta 
gaaatcctag ccaaggggat 
ccgactgccc gcccctcccc 
ccccgcaaac taaagcaaag 



agcccaggat gcgggagacg 180 
agaacaaggc cttctccatg 240 
acggcagctt gggaaaaaga 300 
gaagcaacgt gtatcagcca 360 
ctgctccgcc aagtcacaca 420 
cagaatttct cttaccgatt 480 
acagttgccg aaccaggccg 540 
ttcgtgtaaa tgtgaacact 600 
tgccacacac acagacacgt 660 
eta 703 



<210> 85 

<211> 3871 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504519CB1 



<400> 85 

cccacgcgtc eggcagctga gaeggcageg 
gaggagactc tgeacaggge atggatcact 
tgactttgea gaatgeaaca acagagacat 
tgcaggtgtc caggggcegg agctcagttt 
gggtcagcac gecatgeagt tccccgccga 
cagggagctg cggctcatct gtatctactt 
caactcatct ctgetgaata actacgtcct 
caacctcagg gatcctgtga acatcagctt 
cctgacctgt gtcttctgga aggagggagc 
tgagggctgt egtacagage agccctccca 
cacctacttt gctgttctca tgcaactctc 
acctcttacg tacatctccc tcgtgggctg 
agtcctgctg cacttccatt tcaggaagca 
cctgcatgcc tccgtgctgc tcctgaacat 
gtctcctgtg cccgggtcag catgcacggc 
cagctgcctc acctggatgg ccatcgaggg 
ctacaacatc tacatccgca gatatgtgtt 
agccctcctg gtgctgcttt ccctctctgt 
ccccgtcttc gacagctggg agaatggcac 
gcggagcccc gtggtgcaca gtgtcctggt 
caacctggtg gtgctggcct gggcgctgtg 
tgcaccaagt gtcagggect gccatgacac 
gggaaccacc tgggccttgg ccttcttttc 
cctcttcacc atcttaaact cgctctacgg 
gcggtgccgc tcagaagcag aggecaagge 
aacacagtag tccgggcctc ctggcctgga 
gaggctaegg ctcctgctag agagggtggc 
tgaccgccaa ggggcctttt ccacttccac 
gctctacctc tccctgacat tttgetcegg 
ctttgtcctg gtacctgggc ccagctcgcc 
teaggaagee aagtttcaag gactgtcttt 
cttctcacag accctaggta tccacagctg 
ctaacccagg agcttagtaa aaattgeata 
gggaattccc gcggcctcca cctgcttgct 
aagcacctgc ttggccctgc caccttcctc 



gcagcttctc agggceggag ccagttcttg 60 
gtggtgccct tttcctgtgc ctgtgccttc 120 
gggaagaact cctgagctac atggagaata 180 
tttcctctcg gcaggaggtc agcatgcccg 240 
gctgacccgg gacgcctgca agacccgccc 300 
ctccaacacc cactttttca aggatgaaaa 360 
gggggcccag ctgagtcatg ggcacgtgaa 420 
ctggcacaac caaagectgg aaggctacac 480 
caggaaacag ccctgggggg gctggagccc 540 
ctctcaggtg ctctgccgct gcaaccacct 600 
cccagccctg gtccctgcag agttgctggc 660 
cagcatctcc atcgtggcct cgctgatcac 720 
gagtgactcc ttaacacgea tccacatgaa 780 
cgccttcctg ctgagccccg cattegcaat 840 
tctggccgct gccctgcact acgcgctgct 900 
cttcaacctc tacctcctcc tcgggcgtgt 960 
caagcttggt gtgetagget ggggggcccc 1020 
caagagctcg gtataeggae cctgcacaat 1080 
aggcttccag aacatgtcca tatgctgggt 1140 
catgggctac ggcggcctca cgtccctctt 1200 
gaccctgcgc aggctgeggg agegggegga 1260 
tgtcactgtg ctgggcctca ccgtgctgct 1320 
ttttggcgtc ttcctgctgc cccagctgtt 1380 
tttcttcctt ttcctgtggt tctgctccca 1440 
acagatagag gccttcagct cctcccaaac 1500 
atcctcagcc tctctggccg ccagtagcct 1560 
aggectgetg ctggacccca gaggecactg 1620 
ggcctctcca ggcactgagg ggaaggcatt 1680 
ggcagatcca accttacctg gggcagcaaa 1740 
agggatgtgg gcagagcacc agectgggea 1800 
gagtctgtct gtatgacctt gggcctgcca 1860 
tgacatgggg gcaagcagct ttgtttcagc 1920 
agaccagggg gaagagtgtc agcgtggggt 1980 
aggggcagga tctcattcag gctgccctgg 2040 
caggggaggg ccagatggca tcctggcttg 2100 
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gggcgggtgg gacctacccc aggctctgag actttactgg cctatgcctg aggcctcttt 2160 

tcctttaact ccctaaatta tgatgactcc aagtccaagc ccacccttcc caaagattgg 2220 

gaggttccgc cgttcccaga ggctcctcct gcggtgctcc caagacttcc atagaccatc 2280 

tggaccagta gcccatcccg cagttttctt gggggcagag gaaaacgctt ctttctcctc 2340 

cagctgaatc agctggatcc cagtgtcctg gctgtttggt gattgggcaa gattgaattt 2400 

gcccaggtag gcgtgagagt gtgggtttta aattcgaagc tcaggccata gtttcagaga 2460 

atcaccctta ccccagacct tcatgagaca gtgctcatga agccagtgcg tttcccagaa 2520 

cgaacactag gcggcaccgt tggtccacac tcagaggccc ttggcgccaa gactgcatct 2580 

agaatcgctc aaacacctgt ttgcagaccc catgcaccag ctggaggggc cgtaactgca 2640 

ggactgcgcc tactgagtga cccatttcct ccaggaggaa aggcaagaca cgcttacacg 2700 

gccatttgtc tcttttccca atgcggcggt gcactttcgc tcttgggggc tgcaccccag 2760 

acatagctgg caccagagca gggtgctcag gtggtgggtg ctcagggccc tgccccaggc 2820 

cactgggccg ttttgatgac ctcgaaggtc acaggcagaa aataggagca ggatttcccc 2880 

tggggaaaag ttctcctggg acatcttctg ctcttctgta catttctaga tgcaaataac 2940 

tccttcacca ggcagtgagt ggcgtaggct ctggagccag gctgcctggg ctccaatgcc 3000 

agctctgcca cttgctagct gtgagactgt ggacaaacca ctcagcctct gtgtgcctca 3060 

gttttcctat ttgtaaaata gaggccatag tggtacctat tttgaagact aagtaaaaga 3120 

attcaaataa agagacttgg cacagagtaa gtgctcagta aaaaaaaaaa aaaaaaaaaa 3180 

aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaac acaagagggg 3240 

ggcgggggcg cgagcgcgaa aatagggcag ggtgcagccc gcgggcggga ataaatacta 3300 

cggcgcgcgc gttagctagt gggaggcgca gagaagaaaa ggaggggacg caccgcccga 3360 

tagagaaaaa gtaagaggaa agaaacaact accaggaggt agggcaaccg ggggaggggt 3420 

ggaaccagga cagccaggca gaaagaacac tctgggagga gaggcaacaa cacaataata 3480 

gacagccgct gtgaagaaaa cgcggcgggt agtataccca ccccaaatgt ggtgtgtcaa 3540 

cccccgcggg gagagcgccc accggtgggg gacaacactc ggggaaagaa acaactcgcc 3600 

gtgagaaaag ggtggtgaga ggatccaaaa agcggtacac agtgtgcatg aacgcacacg 3660 

aagatagggg agaaggaaca accccgcgcc ggacaggaaa agcggggcgc ggcacacagg 3720 

agcgggatca gcgcccaagt gaaaagtgac gagggcgcgc caaaagtggt aagcgcacca 3780 

accgcgggca cgttggggtc gggaacaacg cgtacgacga agggcaacag tgcacaccgc 3840 

gatcgctgag ttgaggacag acgacactga t 3871 



<210> 86 
<211> 695 
<212> UNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID Nor 7504705CB1 



<400> 86 

ccgggactcc aacatgcaat 
aaatatttga agccgaggat 
acacaagacc ctgagactga 
atcctggccg ccttagcggt 
tatgaactta gatccgagaa 
atgactacag actttgcgaa 
gctacttcag gaagcgccga 
tttctggagg ctggcacctg 
gcttatatac aatgcttctt 
cccccagatt gataagtaat 
tgtgattgtt ccataataaa 
aaaaaaaaaa aaaaaaaaaa 



gattgcccca gatctctgta 
acaggaagtt cttaaaggag 
cctgcaggac gaaaccatga 
agtaactttg tgttatgaat 
cgctctaagc ctgtccacga 
cgctacgcca tggtttatgg 
gggaccaaat gagactgagg 
attttgtatc cccctgtagc 
tcctgtatat tctcttgtct 
gaaagtgcac tgcagtgagg 
cttctggtgt gataaaaaaa 
aaaaaaaaaa aaaaa 



ctacacctga ccactttctg 60 
cctctctccc tactgctgct 120 
agagcctgat ccttcttgcc 180 
cacatgaaag catggaatct 240 
gctcaatagg gaagcctgtg 300 
atacaatgct gcctataatc 360 
gaagaaaaaa aatctctttt 420 
agcattactg aaatacatag 480 
ggctgcaccc ctttttcccg 540 
gtcaaaggag agtcaacata 600 
aaaaaaaaaa aaaaaaaaaa 660 

695 



<210> 87 
<211> 536 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504738CB1 



<400> 87 

gctaggttct gaagcttctg agttctgcag 
aataccatga agctctgcgt gactgtcctg 
tctccagcgc tctcagcacc aaattccaaa 
ccagtgaatc ctgggtccag gagtacgtgt 
gacaggaagt cttcagggaa ggtcacctga 
ctccatactc aggactcctc tccgcagttc 
tgtgccgtgt tattgtatta ggtgtcattt 
aagtgtcccc tatggggatg gtccactgtc 
taacacattt gattctgtgt gttttcataa 



cctcacctct gagaaaacct ctttgccacc 60 
tctctcctca tgctagtagc tgccttctgc 120 
ccaaaagaag caagcaagtc tgtgctgatc 180 
atgacctgga actgaactga gctgctcaga 240 
gcccggatgc ttctccatga gacacatctc 300 
ctgtcccttc tcttaattta atctttttta 360 
ccattattta tattagttta gccaaaggat 420 
actgtttctc tgctgttgca aatacatgga 480 
taaaacttta aaataaaatg caaaaa 536 



<210> 88 

<211> 549 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7510280CB1 



<400> 88 

agccgagctg ggcgagaagt aggggagggc 
cttcgcagaa cctactcagg cagccagctg 
gtctgcagac gcgatggata acgtgcagcc 
tgtgaaaggc cacgtgaaga tgctgcggct 
agctttattg tgtgttttta ggatattatc 
atcgtatctg tgttggcact gataccagaa 
tttgcacttg tgacagcagt atgctgtctt 
ctgttcaatc ccagcggtcc ttaccagaaa 
taattttata ttacttttta gtttgatact 
aaaaaaaaa 



ggtgctccgc cgcggtggcg gttgctatcg 60 
agaagagttg agggaaagtg ctgctgctgg 120 
gaaaataaaa catcgcccct tctgcttcag 180 
gcattgcttg gtagtcttca tgtaaggctc 240 
aactcactgg taacaacagt attcatgctc 300 
accacaacat tgacagttgg tggaggggtg 360 
gccgacgggg cccttattta ccggaagctt 420 
aagcctgtgc atgaaaaaaa agaagttttg 480 
aagtattaaa catatttctg tattcttcca 540 

549 



<210> 89 

<211> 1888 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_f eature 

<223> Incyte ID No: 7503700CB1 



<400> 89 

gcctgtgcgc tgtgcctccg cttgtctgct 
cctttcgcgt cctctgtcct tgtgtcctct 
tgctccttct gtctcgcgcg ctccttggtc 
agctgtcttc cttttctctt gttccgtcgc 
tctgtgcctg tctctggctt tcttggtttt 
gccgggctca ccccagcccc gcccggaggc 
ggccacactc tgccctactc cttccctccg 



ctcccccgcc ttggccttct ctcgccgccc 60 
ctctcccgtc tcgctctcgt gtccttggac 120 
gctccctccg tgttgctctg ttgtgtggtt 180 
tctttgcttc ttttcctttg gttcgttgtg 240 
tgtgcccagg cccccacgtt ggccgctgct 300 
gccccgcggc cccggctagc cagggcgggc 360 
cgttccgggc ctcggagccg ccttgaggag 420 



67/79 



WO 03/068943 



PCT/US03/04712 



gatgagtccc tggagctggt tcctgctgca 
ttcgcggcgc ggggcgcccg gcaccgccaa 
gaattccttc ttgtggacca ttaagcgaga 
tgtcccgtac acccgagttt gggacttcat 
gagcagcatt gtgtactttg agttggatct 
cagctgtcag atgctgccac agggcgagaa 
ctgccgcctc aagcgccacc tggagtatgt 
agaccagcgc ggcaaggggc tctacgcaga 
ggagcgcaag aggcctgtct gggtgatgct 
taagtcccgt ggagtgcctg tcttagacct 
gaaacagact ggggcagtgg aaaaggtgga 
cttttcacag gtcatttcat gggcaacaac 
tatgaggtag aacacgcccc tgctggacga 
tccacacggc ccactctgtc caccatcttt 
gcaccagaag ccgtatcctc agggcactca 
agtgccgaca cgcccagtga ggccgaacag 
cggaggccgc gactccggca attcagcgat 
gtcccgcaac tccaggtccc tgtcctggac 
cgccgtgggc attcccacca cagccagatg 
actcctgtgt tctgggtgct ggtgctggct 
ctggaagcac caggctaaga acctgacccc 
ccacattctg gtctagcctt gttgggccca 
cccagtgttg attcttctct ttccaaggaa 
aggtgtacag ttttgtaaca tagtgagttg 
aaaataaata cctttttttg tataaaaa 



gaccctctgc ctcctgccca cgggcgcagc 480 
ctgcgagctc aagccccaac aaagcgagct 540 
cccaccatct tacttctttg gcacaatcca 600 
ccccgacaac tctaaggagg ctttcctgca 660 
cacagacccc tataccatct cagctctcac 720 
cctccaagat gtgctcccca gggacatcta 780 
caagctcatg atgcccttgt ggatgacccc 840 
ctacctcttc aatgctattg ccggaaactg 900 
catggtcaac tccctgactg aagtggacat 960 
gttccttgcc caggaggctg agcggctgag 1020 
agagcagtgc catccattga atgggttgaa 1080 
acagtgctgg atgttttgcg gcgtgaaggc 1140 
cccatccaca aagggaagag taaaaagacc 1200 
gctccaaaag tccctaccct ggaagtaccg 1260 
acgctgcctc cccttgtgtc ccggcctgga 1320 
aggttccgga agaagcggag gcggtcacag 1380 
ctgtgggtcc gcctggagga gagtgacata 1440 
aggcacatct ccactgaact gcggctccct 1500 
gtggccagca gtgcctgcct gtctctctgg 1560 
ttccaaacag agacacccct cctgtaacga 1620 
tcggacttga agaatggcca ttcctgtact 1680 
atccagaaga gactgctctt gaaaaagcgg 1740 
tgtgactttg ggttatccaa ctttgggggc 1800 
tgtgaaaata aattataaat gagttgtatg 1860 

1888 



<210> 90 

<211> 1050 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7504685CB1 



<400> 90 

tacggggcgc ctgtgcttga ggttgggggt 
gttggccgcg gcctctgagc tgggatgagc 
ccggagccat ggccagtaca gtggtagcag 
caggccgtta cgttttgcaa gccatgaagc 
aaagcctacc aaaatctccc tactgccaat 
attatgcttt taaatcatcc tgacaaagga 
gaagctaaag atttactaga aggtcaagct 
aagttcgtat tagtttatgt atatgagtac 
acaattttaa aaaatgattt agcacaagct 
tttaaatttg gggattttaa atcagattat 
caagcacaca ccttctgaat agaaatattg 
cctatgtcta cttttcaagg caaagtgaag 
ctttaatggg agactgataa actggtaatt 
gcttaagctg tttaagtagg gactcttctt 
caactggcct ggcaacttgg atacttggaa 
ctgagaccta agaaattgac ctaggggtgt 
ccatttttat ttcatgcatt aaaaatagta 
aatcacagtt cattgcagcc tcaacctcct 



tgcgtcgctc tctggtaaag gcgtgcaggt 60 
cgtgctcccg gtggaagcaa gggagcccag 120 
ttggactgac cattgctgct gcaggatttg 180 
atatggagcc tcaagtaaaa caagtttttc 240 
aaagggaaaa taagagatgc tcatcgacga 300 
ggatctcctt atatagcagc caaaatcaat 360 
aaaaaatgaa gtaaatgtat gatgaatttt 420 
taagttttta taataaaatg cctcagagct 480 
aaatctcaaa gccttggtat aattttcttg 540 
agtttagaat atttgcatat taattatggg 600 
t teat tact c atttagcaga taatttggga 660 
atgacagtcc ttgctctcag ggagccccca 720 
agactgtgat aaatagtatg atggaaatta 780 
attcggtgga aaggctgttc caggtacagg 840 
ccttgtattt aaaagtgaat ttaaccacaa 900 
gtgtgtgtgt gtattctatg tacatataaa 960 
tgataaagat ttcagagtac aggtctggta 1020 

1050 



<210> 91 
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<211> 899 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 7506844CB1 

<400> 91 

cgtgaactgg gcagctgtga cacgcgcacc ttgtcacctt gtcaccaatt tcctgcccgg 60 
gatcccagag ctacccttct tggagcccac aggggcagcc cttggctggt ccctgcgagc 120 
ccgtggagac tgccagagat gtcctctttc ggttacagga ccctgactgt ggccctcttc 180 
accctgatct gctgtccagg agtcaatgaa ttccaacgtc agcgtgtacc agcctccaag 240 
gcaggtcatc ctgacactgc aacccacttt ggtggctgtg ggcaagtcct tcaccattga 300 
gtgcagggtg cccaccgtgg agcccctgga cagcctcacc ctcttcctgt tccgtggcaa 360 
tgagactctg cactatgaga ccttcgggaa ggcagcccct gctccgcagg aggccacagc 420 
cacattcaac agcacggctg acagagagga tggccaccgc aacttctcct gcctggctgt 480 
gctggacttg atgtctcgcg gtggcaacat ctttcacaaa cactcagccc cgaagatgtt 540 
ggagatctat gagcctgtgt cggacagcca gatggtcatc atagtcacgg tggtgtcggt 600 
gttgctgtcc ctgttcgtga catctgtcct gctctgcttc atcttcggcc agcacttgcg 660 
ccagcagcgg atgggcacct acggggtgcg agcggcttgg aggaggctgc cccaggcctt 720 
ccggccatag caaccatgag tggcatggcc accaccacgg tggtcactgg aactcagtgt 780 
gactcctcag ggttgaggtc cagccctggc tgaaggactg tgacaggcag cagagacttg 840 
ggacattgcc ttttctagcc cgaatacaaa cacctggact taaaaaaaaa aaaaaaaaa 899 

<210> 92 

<211> 2529 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 7510259CB1 

<400> 92 

gttcatgatg tacgcacctg ttgaattttc agaagctgaa ttctcacgag ctgaatatca 60 
aagaaagcag caattttggg actcagtacg gctagctctt ttcacattag caattgtagc 120 
aatcatagga attgcaattg gtattgttac tcattttgtt gttgaggatg ataagtcttt 180 
ctattacctt gcctctttta aagtcacaaa tatcaaatat aaagaaaatt atggcataag 240 
atcttcaaga gagtttatag aaaggagtca tcagattgaa agaatgagat gactaacaag 300 
tgagggtgtc agagtcacat ggcagaagtc atttcaagtc atgtcaaatg gtctagtgat 360 
gacataatca tttaagaaat caacaaattc tatcaatact ggatgagatg ccatatttga 420 
gatgagtgca gcaaacaaga tgtctaggat atttcgacat tcttctgtag gcggtcgatt 480 
tatcaaatct catgttatca aattaagtcc agatgaacaa ggtgtggata ttcttatagt 540 
gctcatattt cgatacccat ctactgatag tgctgaacaa atcaagaaaa aaattgaaaa 600 
ggctttatat caaagtttga agaccaaaca attgtctttg accataaaca aaccatcatt 660 
tagactcaca cgctgtggaa taaggatgac atcttcaaac atgccattac cagcatcctc 720 
ttctactcaa agaattgtcc aaggaaggga aacagctatg gaaggggaat ggccatggca 780 
ggccagcctc cage tea tag ggtcaggeca tcagtgtgga gccagcctca tcagtaacac 840 
atggctgctc acagcagctc actgettttg gaaaaataaa gacccaactc aatggattgc 900 
tacttttggt gcaactataa caccacccgc agtgaaacga aatgtgagga aaattattct 960 
tcatgagaat taccatagag aaacaaatga aaatgacatt gctttggttc agctctctac 1020 
tggagttgag ttttcaaata tagtccagag agtttgcctc ccagactcat ctataaagtt 1080 
gccacctaaa acaagtgtgt tegtcacagg atttggatcc attgtagatg atggacctat 1140 
acaaaataca etteggcaag ccagagtgga aaccataagc actgatgtgt gtaacagaaa 1200 
ggatgtgtat gatggcctga taactccagg aatgttatgt gctggattca tggaaggaaa 1260 
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aatagatgca tgtaagggag attctggtgg 
gtacattgta ggtatagtaa gttggggaca 
ctacaccaga gtaactaagt atcgagattg 
attgtccatg agttatacac atggcacaca 
ttaaattcat ttactttgga ttagtgcttt 
gacaaatcta atatcctgag gtggccttta 
agggaagaag acacagcaaa tgacagacag 
ttgtgatact tcctaataag ataaataagt 
ttttccacag gatatgaaga gctgccagta 
gagcatgtga gattcttcta gtgaaaaaga 
cattctagaa ctgataagtg gaccttcagt 
ttatgacttg aactgggctt atatctaata 
tgacatttta aaagttttta aatgtatctg 
agcactagct tcaggaagca tgttgtgttg 
agcatcttgc catctatatg ttagtagcag 
gactctttgg aacgtttttc catgagcaca 
gagaattggt ttcaaataat tagtaacagg 
tatccaatta tttacttaga aatatttata 
tcttcagctt tctgacaagt cagatgtcca 
tatcagagct gatttgtgta aagctttgtg 
ctacgaacta aataaatctt gttatatgga 
cgcccaaga 

<210> 93 
<211> 2268 
<212> UNA 

<213> Homo sapiens 



acctctggtt tatgataatc atgacatctg 1320 
atcgtgtgca cttcccaaaa aacctggagt 1380 
gattgcctca aagactggta tgtagtgtgg 1440 
gagctggtac tcctgcgtat tttgtattgt 1500 
tgctagatgt caagaagccc ttcagaccca 1560 
catacgtagg accaaacccc ctctaccatg 1620 
cacctattcc ttactcacaa gggaaactgc 1680 
ggtttccctc aattgaagac aggaacatca 1740 
atgccaaaat cttacctcat ataatacctg 1800 
acagtcttcc ctgaagactc agggcttcaa 1860 
gtgcaagaat ggagaagcat gggatttgca 1920 
atacagagca ctatcactaa cctcaacagt 1980 
aacttgctgt taacacagtg ttataactca 2040 
ttaagaagct tttctgattt attctttaac 2100 
ttggcccaga aaggacgaaa aaaagattaa 2160 
ggaggataaa aagaagcaga tgaaggctag 2220 
acaagcacgc taatttttga tggaatgagt 2280 
tcagtatatg gcaactggta cttttgtaag 2340 
tcagagtatc aggtcaggtg tctatcagaa 2400 
taaagcacgt aggacagtgc cttgcatata 2460 
aatcaaaaaa aaaaaaaaaa aaaagggggc 2520 

2529 



<220> 

<221> misc_feature 

<223> Incyte ID No: 7510444CB1 



<400> 93 

cctacctctt cctaggccca cagccagtgc 
ttagcccggc gcagggcgcg cagcccaggc 
tggttgggca gcgggtgctg cttctagtgg 
aggctgccaa aatcctgaca atatctacac 
acaggcgcgt cccggatccc cgcactgcca 
tatccaggcg attttccagg ctgggcttgc 
tagccgtggg gcagagaggg gcctctafctg 
ccgggtgtct cagattcttc aagagcatgg 
aaagtttttg atcccagata ttaaagagga 
ttcacctgaa gatcatcaaa aaagaattaa 
attggatggc agaaaagaat ctgaagccct 
atgtagttat ttgctaagca gaaaggatat 
tctggtattt gttgaagcat ttgatttctg 
accatttgtg gccattcttc ccaccacatt 
cttgtcttat gttccagtat tcccttcctt 
agtgaagaat tttctgatgt tctttagttt 
atttgacaac accatcaagg agcatttccc 
tctactgaaa gcagagttgt ggtttgttaa 
cctgcttccc aacactgttt atattggagg 
acaagacttg gacaacttca ttgccaactt 
tggctccatg ttgaacaccc atcagtccca 
tgcccacctc cctcaaggag tgatatggac 
tcatttggcc acaaatgtga aaattgtgga 



ctttggagta ctgaggcgcg cacagagtcc 60 
tgagatccgc tgcttctgtg gaagtgagca 120 
ccttccttct ttctggggtc ctgctctcag 180 
tgggtgagtg cttggccgga gaattcccag 240 
gggctccagc gaacggcgat tgatcagagt 300 
ggacctggct ggaggaggga gaagcccatc 360 
ctgaggtgga agccattacc tactgttgga 420 
tcataatgtg actatgcttc atcagagtgg 480 
ggaaaaatca taccaagtta tcaggtggtt 540 
gaagcatttt gatagctaca tagaaacagc 600 
tgtaaagcta atggaaatat ttgggactca 660 
aatggattcc ttaaagaatg agaactatga 720 
ttctttcctg attgctgaga agcttgtgaa 780 
cggctctttg gattttgggc taccaagccc 840 
gctgactgat cacatggact tctggggccg 900 
ctccaggagc caatgggaca tgcagtctac 960 
agaaggctct aggccagttt tgtctcatct 1020 
ctctgatttt gcctttgatt ttgcccggcc 1080 
cttgatggaa aaacctatta aaccagtacc 1140 
tggggatgca gggtttgtcc ttgtggcctt 1200 
ggaagtcctc aagaagatgc acaatgcctt 1260 
atgtcagagt tctcattggc ccagagatgt 1320 
ctggcttcct cagagtgacc tcctggctca 1380 
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ccccagcatc cgtctttttg tcactcatgg tgggcagaac agcgtaatgg aggccatccg 
tcatggtgtg cccatggtgg gattaccagt caatggagac cagcatggaa acatggtccg 
agtagtagcc aaaaattatg gtgtctctat ccggttgaat caggtcacag ccgacacact 
gacacttaca atgaaacaag tcatagaaga caagaggtac aagtcggcag tggtggcagc 
cagtgtcatc ctgcactctc agcccctgag ccccgcacag cggctggtgg gctggatcga 
ccacatcctc cagactgggg gagcgacgca cctcaagccc tatgccttcc agcagccttg 
gcatgagcag tacctcattg atgtctttgt gtttctgctg gggctcactc tgggcactat 
gtggctttgt gggaagctgc tgggtgtggt ggccaggtgg ctgcgtgggg ccaggaaggt 
gaagaagaca tgaggctagg tgtagccttg ggtgagggga gggcatccct ggtcctttga 
aggttctccc caccccagca cacgccaccc ctctgttctc tcttcagctc cacctgccac 
tgatcctgca acttgcttct ttctattctc tgcctctgtt tagaaatctt cacgcaccac 
tgaggcttct tgacttgccc cttgtgactt gaaaccccag ctcagataca aattttcacc 
tgccagccct gcctcctcct ttctcccttt tcctagacac aggactctga caacttcatc 
ctccttgttt agatgacttc ccagtttcca gtccccattt ctccttctat cacttttcat 
aaaaaaactc aggaaatatt tgacatatct tccatttcaa attcttcc 

<210> 94 
<211> 1819 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7510494CB1 

<400> 94 

ctcttctttg cctctgctgg agtccgggga gtggcgttgg ctgctagagc gatgccgggc 60 
cggagttgcg tcgccttagt cctcctggct gccgccgtca gctgtgccgt cgcgcagcac 120 
gcgccgccgg tgagtgagct tgagccgagg cgcagagagg ggcgtgcagt ggacagagga 180 
ctgcagaaaa tcaacctatc ctccttcagg accaacgtac agaggtgcag ttccatggta 240 
caccataaat cttgacttac caccctacaa aagatggcat gaattgatgc ttgacaaggc 300 
accaatgcta aaggttatag tgaattctct gaagaatatg ataaatacat tcgtgccaag 360 
tggaaaagtt atgcaggtgg tggatgaaaa attgcctggc ctacttggca actttcctgg 420 
cccttttgaa gaggaaatga agggtattgc cgctgttact gatatacctt taggagagat 480 
tatttcattc aatatttttt atgaattatt taccatttgt acttcaatag tagcagaaga 540 
caaaaaaggt catctaatac atgggagaaa catggatttt ggagtatttc ttgggtggaa 600 
cataaataat gatacctggg tcataactga gcaactaaaa cctttaacag tgaatttgga 660 
tttccaaaga aacaacaaaa ctgtcttcaa ggcttcaagc tttgctggct atgtgggcat 720 
gttaacagga ttcaaaccag gactgttcag tcttacactg aatgaacgtt tcagtataaa 780 
tggtggttat ctgggtattc tagaatggat tctgggaaag aaagatgcca tgtggatagg 840 
gttcctcact agaacagttc tggaaaatag cacaagttat gaagaagcca agaatttatt 900 
gaccaagacc aagatattgg ccccagccta ctttatcctg ggaggcaacc agtctgggga 960 
aggttgtgtg attacacgag acagaaagga atcattggat gtatatgaac tcgatgctaa 1020 
gcagggtaga tggtatgtgg tacaaacaaa ttatgaccgt tggaaacatc ccttcttcct 1080 
tgatgatcgc agaacgcctg caaagatgtg tctgaaccgc accagccaag agaatatctc 1140 
atttgaaacc atgtatgatg tcctgtcaac aaaacctgtc ctcaacaagc tgaccgtata 1200 
cacaaccttg atagatgtta ccaaaggtca attcgaaact tacctgcggg actgccctga 1260 
cccttgtata ggttggtgag cacacgtctg gcctacagaa tgcggcctct gagacatgaa 1320 
gacaccatct ccatgtgacc gaacactgca gctgtctgac cttccaaaga ctaagactcg 1380 
cggcaggttc tctttgagtc aaaagcttgt cttcgtccat ctgttgacaa atgacagatc 1440 
tttttttttt ccccctatca gttgattttt cttatttaca gataacttct ttaggggaag 1500 
taaaacagtc atctagaatt cactgagttt ggtttcactt tgacatttgg ggatctggtg 1560 
ggcagtcgaa ccatggtgaa ctccacctcc gtggaataaa tggagattca gcgtgggtgt 1620 
tgaatccagc acgtctgtgt gagtaacggg acagtaaaca ctccacattc ttcagttttt 1680 
cacttctacc tacatatttg tatgtttttc tgtataacag ccttttcctt ctggttctaa 1740 
ctgctgttaa aattaatata tcattatctt tgctgttatt gacagcgata taattttatt 1800 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2268 
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acatatgatt agagggatg 

<210> 95 
<211> 1711 
<212> UNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 6486485CB1 

<400> 95 

aaactatttg agggtagggg ctgtgattat 
aggttggtgc tttgtaggca cccagggact 
agaaggatgg gtccatagta ggacctggtg 
caatccctct cctgggtcag ctcccagggg 
gaggtcattg tgcacaggaa agctctctcc 
gaaatcaagc cagaattcaa caaagtctca 
ttcgagtaga agatctcctt caagggaaag 
tccatgtatc atcccccaag taaacacttc 
agtccctggc ctgtaagtcc ctacaatgat 
caggaggatg ccagaggagc tcagccaaca 
cttgggcact acacagatgc ccttgagctc 
attaagacag aacttaaaaa caatagattg 
ccacaatttt catttatttt taaatgtttc 
gcagtttgtg tatttgttga caactttatg 
tttttccgag accgtctttt ctcagaggct 
ctttcagtgg cagaagaggc cacatctgct 
ctggtgtctc ctcctgatct gggcccaggg 
gatgacaggc acaatagaaa caacggggaa 
cttacaatgt cacctctcct ccaccacggc 
ggaccagctt ctggccattt gtaatgctga 
ggatcgagtg gccccaggtc ccggcctggg 
tacaggggag tacttctgca tctatcacac 
cttcctggag gtcctagaaa gctcagtggc 
gcttggagcc atggccgcga cgctggtggt 
gttgactaga aagaagaaag ccctcagaat 
atcagctgga caggaggaat ggagccccag 
ggcagaagct gcacctgctg ggctctgtgg 
gcatgactac ttcaatgtcc tgagttacag 
gactggttag caaccagagg catcttctgg 

<210> 96 
<211> 5096 
<212> DMA 

<213> Homo sapiens 
<220> 

<221> misc_£eature 

<223> Incyte ID No: 7503772CB1 

<400> 96 

gggaacccag aaccagcccg agccgcctgc 
gcgcggttgg tcctcgcccc ctcccgcccg 
gaccgcgggc tggccgcgct ctggtgcctt 
ggcacgcact accgctacct ctggaggggc 



1819 



ttactctcat atcctcagag cctggtgttg 60 
ttcaaatgaa tgaagggagg gagggaggaa 120 
atgggctggg agctccaggc aaatgtcaac 180 
ctcacccttc tttgcatttc cagctctcat 240 
tctaatctcc tctgatccta ctgcaccaga 300 
gtccagataa acaagacaaa agaaataaga 360 
ttgctgtgtt tgtccaagac ctttgtccca 420 
ttgttcacct gttcattaga tttcaagtgc 480 
aagtttctct tatcattgca cat tct teat 540 
gttcctcatc agtagcagat tcttcagaat 600 
tttgaataaa ggctgatttt tagaaaaaac 660 
actataatcc aaagacgagt gtacctctaa 720 
cttcatggcc tttcttgtgg ctcaccctat 780 
tgtttttaat atggtttttg ccaaacttgg 840 
cagttttacc gtcctatctg cagteggcta 900 
tcctgtaggc cctctgggca gaagcatgcg 960 
gctgaggcag gctcccctcg cctcaggaat 1020 
catttctgea gagaaaggtg gctctatcat 1080 
acaagtgacc caggtcaact gggagcagca 1140 
cttggggtgg cacatctccc catccttcaa 1200 
cctcaccctc cagtegctga ccgtgaacga 1260 
ctaccctgat gggaegtaca ctgggagaat 1320 
tgagcacggt gccaggttcc agattccatt 1380 
catctgcaca gcagtcatcg tggtggtcgc 1440 
ccattctgtg gaaggtgacc tcaggagaaa 1500 
tgctccctca cccccaggaa gctgtgtcca 1560 
agagcagegg ggagaggact gtgecgaget 1620 
aagcctgggt aactgeaget tcttcacaga 1680 
a 1711 



cccgtcgccc ggccgccggc ttagggegea 60 
ccggcctacc aggccatggg ggcgtcccgg 120 
gggctcctgg ggggcctggc gcgcgtcgcg 180 
tgctacccat gtcacctggg ccaggccggc 240 
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taccccgtga gcgccggtga ccagaggcca gatgtggacg aatgccgaac ccacaacggt 300 
ggctgccagc accggtgcgt gaacacccca ggctcctacc tctgtgagtg caagcccggc 360 
ttccggctcc acactgacag caggacctgc ctggccatta actcctgcgc cctgggcaat 420 
ggcggctgcc agcaccactg tgtccagctc acaatcactc ggcatcgctg ccagtgccgg 480 
cccgggttcc agctccagga ggacggcagg cattgtgtcc gtagaagccc gtgtgccaac 540 
aggaacggca gctgcatgca caggtgccag gtggtccggg gcctcgcccg ctgtgagtgc 600 
cacgtgggct atcagctagc agcggacggc aaggcctgtg aagatgtgga cgaatgtgcc 660 
gcagggctgg cccagtgtgc ccatggctgc ctcaacaccc aggggtcctt caagtgcgtg 720 
tgtcacgcgg gctatgagct gggagccgat ggccggcagt gctaccggat tgagatggaa 780 
atcgtgaaca gctgtgaggc caacaacggc ggctgctccc atggctgcag ccacaccagt 840 
gctgggcccc tgtgcacatg tccccgcggc tacgagctgg acacagatca gaggacctgc 900 
atcgatgtcg acgactgtgc agacagcccg tgctgccagc aggtgtgcac caacaaccct 960 
ggcgggtacg agtgcggctg ctacgccggc taccggctca gtgccgatgg ctgcggctgt 1020 
gaggatgtgg atgagtgcgc ctccagccgt ggcggctgcg agcaccactg caccaacctg 1080 
gccggctcct tccagtgctc ctgcgaggcc ggctaccggc tgcacgagga ccgtaggggc 1140 
tgcagccccc tggaggagcc gatggtggac ctggacggcg agctgccttt cgtgcggccc 1200 
ctgccccaca ttgccgtgct ccaggacgag etgccgcaac tcttccagga tgacgacgtc 1260 
ggggccgatg aggaagaggc agagttgcgg ggcgaacaca cgctcacaga gaagtttgtc 1320 
tgcctggatg actcctttgg ccatgactgc agcttgacct gtgatgactg caggaacgga 1380 
gggacctgcc tcctgggcct ggatggctgt gattgccccg agggctggac tgggctcatc 1440 
tgcaatgaga cttgtcctcc ggacacctct gggaagaact gcagcttctc ctgcagctgt 1500 
cagaatggtg ggacctgcga ctctgtcacg ggggcctgcc gctgcccccc gggtgtcagt 1560 
ggaactaact gtgaggatgg ctgccccaag ggctactatg gcaagcactg tcgcaagaaa 1620 
tgcaactgtg ccaaccgggg ccggtgccac cgcctctacg gggcctgcct ctgcgaccca 1680 
gggctctacg gccgcttctg ccacctcacc tgcccgccgt gggcctttgg gccgggctgc 1740 
tcggaggagt gccagtgtgt gcagccccac acgcagtcct gtgacaagag ggatggcagc 1800 
tgctcctgca aggctggctt ccggggcgag cgctgtcagg cagagtgtga gctgggctac 1860 
tttgggccgg ggtgctggca ggcatgcacc tgcccagtgg gcgtggcctg tgactccgtg 1920 
agcggcgagt gtgggaagcg gtgtcctgct ggcttccagg gagaggactg tggccaagag 1980 
tgcccggtgg ggacgtttgg cgtgaactgc tcgagctcct gctcctgtgg gggggccccc 2040 
tgccacgggg tcacggggca gtgccggtgt ccgccgggga ggactgggga agactgtgag 2100 
gcagattgtc ccgagggccg ctgggggctg ggctgccagg agatctgccc agcatgccag 2160 
cacgctgccc gctgcgaccc tgagaccgga gcctgcctgt gcctccctgg cttcgtcggc 2220 
agccgctgcc aggacgtgtg cccagcaggc tggtatggtc ccagctgcca gacaaggtgc 2280 
tcttgtgcca atgatgggca ctgccaccca gccaccggac actgcagctg tgcccccggg 2340 
tggaccggct ttagctgcca gagagcctgt gatactgggc actggggacc tgactgcagc 2400 
cacccctgca actgcagcgc tggccacggg agctgtgatg ccatcagcgg cctgtgtctg 2460 
tgtgaggctg gctacgtggg cccgcggtgc gagcagcagt gtccccaggg ccactttggg 2520 
cccggctgtg agcagctgtg ccagtgtcag catggagcag cctgtgacca cgtcagcggg 2580 
gcctgcacct gcccggccgg ctggaggggc accttctgcg agcatgcctg cccggccggc 2640 
ttctttggat tggactgtcg cagtgcctgc aactgcaccg ccggagctgc ctgtgatgcc 2700 
gtgaatggct cctgcctctg ccccgctggc cgccggggcc cccgctgtgc cgagacctgc 2760 
ccagcccaca cctacgggca caattgcagc caggcctgtg cctgctttaa cggggcctcc 2820 
tgtgaccctg tccacgggca gtgccactgt gcccctggct ggatggggcc ctcctgcctg 2880 
caggagtgcc tcccccggga cgtcagagct ggctgccggc acagcggcgg ttgcctcaac 2940 
gggggcctgt gcgacccgca cacgggccgc tgcctctgcc cagccggctg gactggggac 3000 
aagtgtcaga gcccctgcct gcggggctgg tttggagagg cctgtgccca gcgctgcagc 3060 
tgcccgcctg gcgctgcctg ccaccacgtc actggggcct gccgctgtcc ccctggcttc 3120 
actggctccg gctgcgagca gggatgtccg cccgggcggt atgggccagg ctgtgaacag 3180 
ctgtgtgggt gtctcaacgg gggctcctgt gatgcggcca cgggggcctg ccgctgcccc 3240 
actgggttcc tcgggacgga ctgcaacctc agtgagtggc tggtggcagc agtgctgagt 3300 
ggcgcctcct gtgtctccag gcctgtccgc agggccgctt cggccccaac tgcacccacg 3360 
tgtgtgggtg tgggcagggg gcggcctgcg accctgtgac cggcacctgc ctctgccccc 3420 
cggggagagc cggcgtccgc tgtgagcgag gctgccccca gaaccggttt ggcgtgggct 3480 
gcgagcacac ctgctcctgc agaaatgggg gcctgtgcca cgccagcaac ggcagctgct 3540 
cctgtggcct gggctggacg gggcggcact gcgagctggc ctgtccccct gggcgctacg 3600 
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gagccgcctg ccatctggag tgctcctgcc acaacaacag cacgtgtgag cctgccacgg 3660 
gcacctgccg ctgcggcccc ggcttctatg gccaggcctg cgagcacccc tgtccccctg 3720 
gcttccacgg ggctggctgc caggggttgt gctggtgtca acatggagcc ccctgcgacc 3780 
ccatcagtgg ccgatgcctc tgccctgccg gcttccacgg ccacttctgt gagagggggt 3840 
gtgagccagg ttcatttgga gagggctgcc accagcgctg tgactgtgac gggggggcac 3900 
cctgtgaccc tgtcaccggt ctctgccttt gcccaccagg gcgctcagga gccacctgta 3960 
acctggattg cagaaggggc cagtttgggc ccagctgcac cctgcactgt gactgcgggg 4020 
gtggggctga ctgcgaccct gtcagtgggc agtgtcactg tgtggatggc tacatggggc 4080 
ccacgtgccg ggaaggtggg cccctccggc tccccgagaa cccgtcctta gcccagggct 4140 
cagcgggcac actgcccgcc tccagcagac ccacatcccg gagcggtgga ccagcgaggc 4200 
actagtagag gcagtcccgt ggagcccgcc tctccagtcc cagccagagg ggaccctggc 4260 
ctttggtgac cactgagaag gacacttcac gggcccagag ctcctggtac tgcccttcct 4320 
ttgagggccg tggagggctg tggacagccc agcaacctgt cgctcttgga ggctggtgtg 4380 
gccttgagga gggaagcctc gcatggccgc tggaagagag gcgcctcctg gcctggctct 4440 
gcagaaccca ggggcacgct ctgggcctgg gctgaggaag tcccgctctc cccgcggctc 4500 
tgagtcggac tgaggacagg tgtgggcgcc agtgtgggtg caggcgcagg tgcaggcaca 4560 
gggccactgt cctccaggca ggctttttgg tgctaggccc tgggactgga agtcgcccag 4620 
cccgtattta tgtaaaggta tttatgggcc actgcacatg cccgctgcag ccctgggatc 4680 
agctggaagc tgcctgtcat ctcctgccca atccccagaa accctgattc aggtctgcag 4740 
gctcctgcgg gctcaccagg ctgctggctc cggtaccatg taaacctagg aaggtaaagg 4800 
agcaggcaac ctcctcgtgg cctgtgtgtt tgctgtgtta cgtggactct gtgtgggctc 4860 
ctccctgggg cccggccagc ataacggtgc acccagggac ctcccagtgc acccggggcc 4920 
ctttgcaggg gtgggggtgc cacacaagtg aagaagttgg gactcatctc agttcccagt 4980 
gctattgagg agaacgctgg ggctgcattc attaccgctg agacccagag actggctgtt 5040 
cccagagaat ggcccagggg gaggagggct ggtgtggaag ggcaacttgg actgag 5096 

<210> 97 
<211> 5049 
<212> UNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyto ID No: 7503773CB1 

<400> 97 

gggaacccag aaccagcccg agccgcctgc cccgtcgccc ggccgccggc ttagggcgca 60 
gcgcggttgg tcctcgcccc ctcccgcccg ccggcctacc aggccatggg ggcgtcccgg 120 
gaccgcgggc tggccgcgct ctggtgcctt gggctcctgg ggggcctggc gcgcgtcgcg 180 
ggcacgcact accgctacct ctggaggggc tgctacccat gtcacctggg ccaggccggc 240 
taccccgtga gcgccggtga ccagaggcca gatgtggacg aatgccgaac ccacaacggt 300 
ggctgccagc accggtgcgt gaacacccca ggctcctacc tctgtgagtg caagcccggc 360 
ttccggctcc acactgacag caggacctgc ctggccatta actcctgcgc cctgggcaat 420 
ggcggctgcc agcaccactg tgtccagctc acaatcactc ggcatcgctg ccagtgccgg 480 
cccgggttcc agctccagga ggacggcagg cattgtgtcc gtagaagccc gtgtgccaac 540 
aggaacggca gctgcatgca caggtgccag gtggtccggg gcctcgcccg ctgtgagtgc 600 
cacgtgggct atcagctagc agcggacggc aaggcctgtg aagatgtgga cgaatgtgcc 660 
gcagggctgg cccagtgtgc ccatggctgc ctcaacaccc aggggtcctt caagtgcgtg 720 
tgtcacgcgg gctatgagct gggagccgat ggccggcagt gctaccggat tgagatggaa 780 
atcgtgaaca gctgtgaggc caacaacggc ggctgctccc atggctgcag ccacaccagt 840 
gctgggcccc tgtgcacatg tccccgcggc tacgagctgg acacagatca gaggacctgc 900 
atcgatgtcg acgactgtgc agacagcccg tgctgccagc aggtgtgcac caacaaccct 960 
ggcgggtacg agtgcggctg ctacgccggc taccggctca gtgccgatgg ctgcggctgt 1020 
gaggatgtgg atgagtgcgc ctccagccgt ggcggctgcg agcaccactg caccaacctg 1080 
gccggctcct tccagtgctc ctgcgaggcc ggctaccggc tgcacgagga ccgtaggggc 1140 
tgcagccccc tggaggagcc gatggtggac ctggacggcg agctgccttt cgtgcggccc 1200 
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ctgccccaca ttgccgtgct ccaggacgag ctgccgcaac tcttccagga tgacgacgtc 1260 
ggggccgatg aggaagaggc agagttgcgg ggcgaacaca cgctcacaga gaagtttgtc 1320 
tgcctggatg actcctttgg ccatgactgc agcttgacct gtgatgactg caggaacgga 1380 
gggacctgcc tcctgggcct ggatggctgt gattgccccg agggctggac tgggctcatc 1440 
tgcaatgaga cttgtcctcc ggacaccttt gggaagaact gcagcttctc ctgcagctgt 1500 
cagaatggtg ggacctgcga ctctgtcacg ggggcctgcc gctgcccccc gggtgtcagt 1560 
ggaactaact gtgaggatgg ctgccccaag ggctactatg gcaagcactg tcgcaagaaa 1620 
tgcaactgtg ccaaccgggg ccggtgccac cgcctctacg gggcctgcct ctgcgaccca 1680 
gggctctacg gccgcttctg ccacctcacc tgcccgccgt gggcctttgg gccgggctgc 1740 
tcggaggagt gccagtgtgt gcagccccac acgcagtcct gtgacaagag ggatggcagc 1800 
tgctcctgca aggctggctt ccggggcgag cgctgtcagg cagagtgtga gctgggctac 1860 
tttgggccgg ggtgctggca ggcatgcacc tgcccagtgg gcgtggcctg tgactccgtg 1920 
agcggcgagt gtgggaagcg gtgtcctgct ggcttccagg gagaggactg tggccaagag 1980 
tgcccggtgg ggacgtttgg cgtgaactgc tcgagctcct gctcctgtgg gggggccccc 2040 
tgccacgggg tcacggggca gtgccggtgt ccgccgggga ggactgggga agactgtgag 2100 
gcagattgtc ccgagggccg ctgggggctg ggctgccagg agatctgccc agcatgccag 2160 
cacgctgccc gctgcgaccc tgagaccgga gcctgcctgt gccfcccctgg cttcgtcggc 2220 
agccgctgcc aggacgtgtg cccagcaggc tggtatggtc ccagctgcca gacaaggtgc 2280 
tcttgtgcca atgatgggca ctgccaccca gccaccggac actgcagctg tgcccccggg 2340 
tggaccggct ttagctgcca gagagcctgt gatactgggc actggggacc tgactgcagc 2400 
cacccctgca actgcagcgc tggccacggg agctgtgatg ccatcagcgg cctgtgtctg 2460 
tgtgaggctg gctacgtggg cccgcggtgc gagcagcagt gtccccaggg ccactttggg 2520 
cccggctgtg agcagctgtg ccagtgtcag catggagcag cctgtgacca cgtcagcggg 2580 
gcctgcacct gcccggccgg ctggaggggc accttctgcg agcatgcctg cccggccggc 2640 
ttctttggat tggactgtcg cagtgcctgc aactgcaccg ccggagctgc ctgtgatgcc 2700 
gtgaatggct cctgcctctg ccccgctggc cgccggggcc cccgctgtgc cgagacctgc 2760 
ccagcccaca cctacgggca caattgcagc caggcctgtg cctgctttaa cggggcctcc 2820 
tgtgaccctg tccacgggca gtgccactgt gcccctggct ggatggggcc ctcctgcctg 2880 
caggagtgcc tcccccggga cgtcagagct ggctgccggc acagcggcgg ttgcctcaac 2940 
gggggcctgt gtgacccgca cacgggccgc tgcctctgcc cagccggctg gactggggac 3000 
aagtgtcaga gcccctgcct gcggggctgg tttggagagg cctgtgccca gcgctgcagc 3060 
tgcccgcctg gcgctgcctg ccaccacgtc actggggcct gccgctgtcc ccctggcttc 3120 
actggctccg gctgcgagca ggcctgccca cccggcagct ttggggagga ctgtgcgcag 3180 
atgtgccagt gtcccggtga gaacccggcc tgccaccctg ccaccgggac ctgctcatgt 3240 
gctgctggct accacggccc cagctgccag caacgatgtc cgcccgggcg gtatgggcca 3300 
ggctgtgaac agctgtgtgg gtgtctcaac gggggctcct gtgatgcggc cacgggggcc 3360 
tgccgctgcc ccactgggtt cctcgggacg gactgcaacc tcacctgtcc gcagggccgc 3420 
ttcggcccca actgcaccca cgtgtgtggg tgtgggcagg gggcggcctg cgaccctgtg 3480 
accggcacct gcctctgccc cccggggaga gccggcgtcc gctgtgagcg aggctgcccc 3540 
cagaaccggt ttggcgtggg ctgcgagcac acctgctcct gcagaaatgg gggcctgtgc 3600 
cacgccagca acggcagctg ctcctgtggc ctgggctgga cggggcggca ctgcgagctg 3660 
gcctgtcccc ctgggcgcta cggagccgcc tgccatctgg agtgctcctg ccacaacaac 3720 
agcacgtgtg agcctgccac gggcacctgc cgctgcggcc ccggcttcta tggccaggcc 3780 
tgcgagcacc cctgtccccc tggcttccac ggggctggct gccaggggtt gtgctggtgt 3840 
caacatggag ccccctgcga ccccatcagt ggccgatgcc tctgccctgc cggcttccac 3900 
ggccacttct gtgagagggg gtgtgagcca ggttcatttg gagagggctg ccaccagcgc 3960 
tgtgactgtg acgggggggc accctgtgac cctgtcaccg gtctctgcct ttgcccacca 4020 
gggcgctcag gagccacctg taacctgggt gggcccctcc ggctccccga gaacccgtcc 4080 
ttagcccagg gctcagcggg cacactgccc gcctccagca gacccacatc ccggagcggt 4140 
ggaccagcga ggcactagta gaggcagtcc cgtggagccc gcctctccag tcccagccag 4200 
aggggaccct ggcctttggt gaccactgag aaggacactt cacgggccca gagctcctgg 4260 
tactgccctt cctttgaggg ccgtggaggg ctgtggacag cccagcaacc tgtcgctctt 4320 
ggaggctggt gtggccttga ggagggaagc ctcgcatggc cgctggaaga gaggcgcctc 4380 
ctggcctggc tctgcagaac ccaggggcac gctctgggcc tgggctgagg aagtcccgct 4440 
ctccccgcgg ctctgagttg gactgaggac aggtgtgggc gccagtgtgg gtgcaggcgc 4500 
aggtgcaggc acagggccac tgtcctccag gcaggctttt tggtgctagg ccctgggact 4560 
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ggaagtcgcc cagcccgtat 
cagccctggg atcagctgga 
ttcaggtctg caggctcctg 
aggaaggtaa aggagcaggc 
tctgtgtggg ctcctccctg 
tgcacccggg gccctttgca 
ctcagttccc agtgctattg 
gagactggct gttcccagag 
tggactgag 

<210> 98 
<211> 1086 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 7504698CB1 

<400> 98 

gcgggacttc ctgtgtcgta tttccaagga ctccaaagcg aggccgggga ctgaaggtgt 60 

gggtgtcgag ccctctggca gagggttaac ctgggtcaaa tgcacggatt ctcacctcgt 120 

acagttacgc tctcccgcgg cacgtccgcg aggacttgaa gtcctgagcg ctcaagtttg 180 

tccgtaggtc gagagaaggc catggaggtg ccgccaccgg caccgcggag ctttctctgt 240 

agagcattgt gcctatttcc ccgagtcttt gctgccgaag ctgtgactgc cgattcggaa 300 

gtccttgagg agcgtcagaa gcggcttccc tacgtcccag agccctatta cccggaatct 360 

ggatgggacc gcctccggga gctgtttggc aaagactgtc acgggaagtc tttttaggat 420 

aaacgtaggc ctgcgtggcc tggtggctgg tggcataatt ggagccttgc tgggcactcc 480 

tgtaggaggc ctgctgatgg catttcagaa gtactctggt gagactgttc aggaaagaaa 540 

acagaaggat cgaaaggcac tccatgagct aaaactggaa gagtggaaag gcagactaca 600 

agttactgag cacctccctg agaaaattga aagtagttta caggaagatg aacctgagaa 660 

tgatgctaag aaaattgaag cactgctaaa ccttcctaga aacccttcag taatagataa 720 

acaagacaag gactgaaagt gctctgaact tgaaactcac tggagagctg aagggagctg 780 

ccatgtccga tgaatgccaa cagacaggcc actctttggt cagcctgctg acaaatttaa 840 

gtgctggtac ctgtggtggc agtggcttgc tcttgtcttt ttcttttctt tttaactaag 900 

aatggggctg ttgtactctc actttactta tccttcaatt taaatacata cttatgtttg 960 

tattaatcta tcaatatatg catacatgaa tatatccacc cacctagatt ttaagcagta 1020 

aataaaacat ttcgcaaaag attaaagttg aattttacag ttaaaaaaaa aaaaaaaaaa 1080 

aaaact 1086 

<210> 99 

<211> 1099 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_£eature 

<223> Incyte ID No: 7510361CB1 

<400> 99 

caaggatttc atgagcatcc tcctctaaac gcgtgtcaag acaaaagatg cttcagcttt 60 
ggaaacttgt tctcctgtgc ggcgtgctca ctgggacctc agagtctctt cttgacaatc 120 
ttggcaatga cctaagcaat gtcgtggata agctggaacc tgttcttcac gagggacttg 180 
agacagttga caatactctt aaaggcatcc ttgagaaact gaaggtcgac ctaggagtgc 240 
ttcagaaatc cagtgcttgg caactggcca agcagaaggc ccaggaagct gagaaattgc 300 
tgaacaatgt catttctaag ctgcttccaa ctaacacgga catttttggg ttgaaaatca 360 



ttatgtaaag gtatttatgg 
agctgcctgt catctcctgc 
cgggctcacc aggctgctgg 
aacctcctcg tggcctgtgt 
gggcccggcc agcataacgg 
ggggtggggg tgccacacaa 
aggagaacgc tggggctgca 
aatggcccag ggggaggagg 



gccactgcac atgcccgctg 4620 
ccaatcccca gaaaccctga 4680 
ctccggtacc atgtaaacct 4740 
gtttgctgtg ttacgtggac 4800 
tgcacccagg gacctcccag 4860 
gtgaagaagt tgggactcat 4920 
ttcattaccg ctgagaccca 4980 
gctggtgtgg aagggcaact 5040 

5049 
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gcaactccct catcctggat gtcaaagctg 
tgagcttccc tgtcaccgcg aatgtcactg 
acctgaaagc ctccttggac ctcctgaccg 
accagcctgt tgccgtcctg ggagaatgcg 
tgctggacaa ccaaaaatgt attgaggctg 
cactttggga ggctgagaca cagccaaatc 
acgctgaaaa gcactgtatc ctccctgctg 
ttcatccact ccctggatgt gaatgtcatt 
acccagctgc aaaccctcat ctgaagagga 
ggtgaggagc cagtctctgt gccccaatgc 
cgtggcttcc cttattttcg tgttagaaga 
cctagccccc atcccaggct aggccctgat 
tcactgaact ctgattctg 

<210> 100 

<211> 1110 

<212> DNA 

<213> Homo sapiens 



aaccgatcga tgatggcaaa ggccttaacc 420 

tggccgggcc catcattggc cagattatca 480 

cagtcacaat tgaaactgat ccccagacac 540 

ccagtgaccc aaccagcatc tcactttcct 600 

ggcacgatgg ctccacaccg gtaatcccag 660 

atcaacaagt tcgtgaatag cgtgatcaac 720 

cagaaggaga tatgtccact gatccgcatc 780 

cagcaggtcg tcgataatcc tcagcacaaa 840 

cgaatgagga ggaccactgt ggtgcatgct 900 

acaggggcct atggtgaagt aaaagtcaag 960 

ctgtgccttc atctcagtca tagattgagc 1020 

tctggtcaca ctctgaacac tgagccttaa 1080 

1099 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7507013CB1 

<400> 100 

gcctggttac cgggagtggg gcgcccctcc 
tttcacagct ggctgtagct ggccaaggag 
acatttctgg gcaggtgtct ggaaaatatg 
atggcattaa ccacctgcct ggatacttca 
ctgggaagag gccagggagt attttgagga 
ctacatctac aatggcaaag gagggcgtgg 
gctgacaggt ggcatcctgc tcattgtcct 
ctggcgacag caccgaggcc agcagccctg 
gaggcctcac tgaagagctg ctttcgagac 
accggattcc ggaagccgct aggcctcata 
ggtgggagta ggggtcatcc ggcccgaggc 
tggatataca catgttttcg gcaacgtgtt 
actctcctga ccgtgagggc actggtcagt 
aaattcggac ccaggagccc agccccggct 
tgacagcccc ctccagtgcc acagggtacg 
tgtcttcgtg cactccccgt gcggtacagg 
ggcatatttg caagcgcgct cggtgcgggc 
tgagctgttc ccctaaataa aaaccttcgg 
aaaaaataat aaatgaaatg aactgcgaaa 



tccttatccc ctcccctctt ccctgtcccc 60 

ttctcgatta aagaggaagg ggcagtgctc 120 

aggggccacc cctctctgct gctgctatat 180 

cccagtgagg agacagacca agaggtgttc 240 

caacactctc acggagcgct tttgggagag 300 

acgagtggat gtggccagcc tggctgtggg 360 

ggccggcctg ggagcctttt ggtatctgcg 420 

tccccaagag gccgggctca ttagccctct 480 

ccggctctcc gaaccgtgcc cctgattcat 540 

gacgccgaag ctggacttgg agtggggaat 600 

ctgccctggc acacgcgttt ccgccgcgta 660 

cccgtgtcct ggcccctcac gggcccccac 720 

tccgcccccg tggtaggcag acgcgcgggg 780 

gtgccatctt gtgtatgggc agatatgacc 840 

cacacgcaga gccccgcctg tgcacacgcg 900 

ggcacttcgt aacccaggga aagggcgggg 960 

aggctcgcat tgcacccagg gagctggagt 1020 

aaaggagacc aaaaaaagca gaaataatgc 1080 

1110 



<210> 101 

<211> 3082 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mi sc_£ eature 

<223> Incyte ID No: 7510507CB1 



<400> 101 

gggagaacgc gagtgttagt taacatggct tggtgtgtgg ccacagctga cccggcgcat 60 
acctcccgtc ctctctttac agggctggcg gtgtctcggg gctcggctgg ccatgcctgg 120 
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agcgcggggt ttgactgggc cgccgttgtt gtagtgaccg ggaggcgctg ccgctctggg 180 
cagacggttc cgggagccgc acggtcccct ctccttcccc atcctctccc ctcccctctc 240 
cgggttcccc cacccacagg agccttgggc cgaccactcc cccgatggcc tcagccacgg 300 
aggaccccgt tctggagcgt tatttcaaag gccacaaagc tgcgatcacc tccttggacc 360 
tcagccccaa cggcaagcaa cttgctactg cttcttggga tacctttctc atgctatgga 420 
atttcaagcc acatgctaga gcttacagat atgtgggtca caaggatgtt gtaaccagcg 480 
tgcagttttc tccacatgga aacttattgg cgtctgcctc acgagacaga accgtgagac 540 
tctggattcc tgataagaga ggaaaattct cagaatttaa agctcataca gctccagttc 600 
gaagtgtaga cttttcagct gatggccagt ttctagctac agcttctgaa gacaaatcca 660 
taaaagtatg gagcatgtat cgccagcgct tcctgtattc cttgtatcga catacacact 720 
gggtacgctg tgccaaattt tcacccgatg gaagactaat tgtgtcatgt agtgaggata 780 
aaactattaa aatttgggat accacaaata agcaatgtgt taataacttc tcagatttcg 840 
ttggatttgc aaattttgtg gactttaacc ctagtggtac atgcatagct tcagcaggtt 900 
ctgatcaaac tgtgaaagtc tgggatgtaa gagtgaacaa attactacag cattaccaag 960 
ttcacagcgg tggagttaat tgcatatcat tccatccttc gggtaactat ctcatcacag 1020 
cttcttcaga tggtaccctt aagattctgg acctcttaga aggaaggctc atctatacac 1080 
ttcaaggaca tacgggacct gtctttactg tttcattttc aaaaggtgga gagctatttg 1140 
catcaggagg tgcagacaca caggtcttat tatggaggac taactttgat gaattgcatt 1200 
gtaaaggtct taccaaaaga aatctcaaaa gattacattt tgatt caeca ccacatcttc 1260 
ttgatatcta cccaagaaca ccacatcccc atgaggaaaa agttgagact gtagaaacaa 1320 
cagaaaccag tggtaggact ctgccagaca agggtgaaga ggcctgtgga tatttcttga 1380 
acccttcctt aatgtcacca gaatgtttgc caacaaccac gaaaaagaaa acagaagaca 1440 
tgagtgacct cccctgtgaa agtcaaagga gcatacctct cgctgtgact gatgetttag 1500 
agcatattat ggaacaactc aatgttttga cacagactgt ttcaatcttg gagcagegae 1560 
tgactttgac agaggataag ctgaaagact gecttgaaaa tcagcaaaag cttttcagtg 1620 
ctgtccaaca gaaaagctga ataaaaaatt cattttcatt tgttgggcag aggeccaata 1680 
aatgaacaaa tgtacataca ctcaggaagg tagtacaaga tactccatac aacacaacca 1740 
tgtgctattt atcatggcat ttcttaaaag ggtgagcaac agaacaaaag gcagaaaagg 1800 
catacctaag gactaattta aacacatatc aatgtgaagg actaatttaa attactatca 1860 
tttatgattg cagtaataaa gtgataagca ttcaagcaac tctgtatttt ccccatatta 1920 
ttttaaatgt ccat tt teat ttataggeca aatcctgcca ggaaagtaac cagatctctg 1980 
gatttcactg ttaagtcatt tcagattgac catattcaga cagtcatggg gtgaaataat 2040 
tcacttacct ccaaaatagc atcctatatg ccaataatga gttattgatc tgactagttg 2100 
tatgtctttc tgttcaaaat agaaattatc ctttcttact aatgccttga aagaatgaac 2160 
aaataaaaat tcccagacca cagaatttcc acagcaagaa tacacttatt ttaattaaca 2220 
atagcacaga tatagcatag ggcagtgggt tttttagtta atttatggcg tactttgttt 2280 
atccattggc caacctgaag gaaatgaaac tcacctatct ttctatcaca gatgaatgtg 2340 
ctagatgaat gatttgggtt gtatctgatc atggttcaca aaaattatgt tagtgtgttt 2400 
teagtatget aaaaagtcag agtgatacaa aagtgatatt taaaaatata cacacatgta 2460 
tgtacacatg cttcagaaat attgttgctt agggtgattt gggcagctaa atagtaagta 2520 
ctttttaaaa tttttgeate atcatcttcc tatttaatga attg tgatt t aaaacaaaat 2580 
gaaaataagc caggtattct aaaagatcct ggatacaaat taagaatttt gctttatttt 2640 
aaaccaaatt gagattaaat tgaagaaaag caagcaaatt aatttcagct tgattatcaa 2700 
cctgtatcaa gaacaaaaat gggaggaggt gtccacattt atggtgtgta taggtaacat 2760 
ggggaaaatg ctattctgtg ttttggaaaa gaagaaatag tgccgtccta tttatttcta 2820 
tatttagaaa tttttctcaa agaaatttca attgtatcta tgagatgggt ttctaagtat 2880 
cttattgtgt gttataagtg ccttttaata tcatactaag tgtgagcttc tggacatttt 2940 
caagagctta caaaaactaa gtggcattgt atttttataa ccccattgag aagactaagt 3000 
aagaaatgaa atgtcctatc aattttattt tgtcatgett caaacaataa agacatttct 3060 
gctttaaaaa aaaaaaaaaa gg 3082 

<210> 102 
<211> 1548 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No; 90106370CB1 

<400> 102 

cctctgcctc atttacttcc tcttctcacc 
agccgccctc tgggcgcgtg gaggggcccc 
tgccctgtgg gccctgcatc cccatcatgc 
tcctcaccac cgctgtgttg gctgaacgcc 
gccaccgtgc acccaccctg gtgtggcgcc 
gcaccgcccg agagcgctct gaggactggt 
gggcccctga gcctcccacc caggtgggca 
ccccctcagc cccaaattct gctcccagca 
cagcccggag caccttctgg gggccccagc 
tggtgagctg ggctgaaccc gagcagaggc 
aggccaggag gcagcggcca gggagcccgg 
ccttggagca gatctcagct ttctggaagc 
tccccagggt tccccagaga tccccgaggc 
aggatccagc attaggattg agactgcccc 
cctggagtcc ccctccccgg gtctgggtgg 
agcagtgcct ctgctcagct acctgggctg 
ccaggccaga agccttcaag atcgcatcca 
gaaatggcat cgaccagcca cctccacctt 
aaagagaacg ggtcctgcgg tgctgaacag 
atgacaggaa tccagtttcc cagagccaca 
gtgaggtcct ctaggaaaat acacaaagag 
ttcatatgaa ctgtggcttt gacccccaaa 
cagcggcact gtcttgctgg agtggggacc 
gcccattctc ctccatctgc acacgtctca 
aaggccatgg ccaggggaaa ctgagtttca 
cagggcaatc ccaaaatgtg acctctgatc 



cctgtgtctc tcagatgccg ttgactccag 60 
ccgcatggga agcagcccca tggccctcac 120 
tggtcctggc caccctggct gcgctcttca 180 
tgttccgccg tgctctccgc ccagacccca 240 
caggaggaga gctgtggatt gagcccatgg 300 
atggctctgc ggtccccctg ctgacagatc 360 
ctttggaggc ccgagcaaca gccccacctg 420 
acttgggccc ccagaccgta ctggaggtcc 480 
cctgggaggg gaggcccccc gccacaggcc 540 
cagaggccag cgtccagttt gggagccccc 600 
atcctgagtg gggcctccag ccacgggtca 660 
gtgaaggccg gaccagtgtg gggttctgaa 720 
aggccttgcc tcagtgggac cggggacccc 780 
agcgaagatg cccttcccag gctccttcca 840 
tggccaggct atgtggacta ggggaagccc 900 
tggctcagag acctgggggt ggagccaatg 960 
gatgaagaac ccaaggtact agatagtcag 1020 
ctttcagtgt ttaccgaagc caccaatacc 1080 
cctcggtgtg gcgatgacag ctggcaggag 1140 
aatcctgttc tccttggcca ctcacccact 1200 
aggaccagac caggcagagg aacattttgt 1260 
ctgcaaggag gaacttgctg ggccaagctg 1320 
tagagtcaga gaaaacccac aggctcctct 1380 
gcctcggacc ctcaccactc catggtgagg 1440 
tccaatgtgg agaggagcgt tgtcctagag 1500 
atcgtccttt ccagcttg 1548 
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